Gene Francci3_0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0071 
Symbol 
ID3905406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp89381 
End bp90892 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content73% 
IMG OID637877401 
Producthypothetical protein 
Protein accessionYP_479194 
Protein GI86738794 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCGT ACCAACCAGC GGGGCCGGAT CCGCGGCCTA GCGGCTACGG CCCTCCGACC 
GGGCCATATG ACGCCGCGGG CCCGGTGACC GGGCCCTACC ACCCGGTACC GGCGAGCCAT
GGCGATCCCG GCCATGGCGA TCCCGGCCAT GGCAGTCCAC GTCAGGGCAG TTCGATACAG
CCCGAGGATA TCGATTCCCC GTCCACCGGC ACCTCGATGC GCCGGGCGCT CGTCCCGCCC
GCTCGGACCG CTCGGACCGC TCGGGCTGAG AGCCCGCGGT CCCGAGCCGT TCCTGGTTCC
ACTCCCACTC CCGCACCCGC ACCCGGTTCC ACTCCCGGGC GTCGACCACC CGGATGGGCT
ACCGATGGAA GGTCGGCTCC ACGAGGTGGA CCGGATCGGC AGAATGAGGT CGGTCTGCCG
CGGGCCAGAA GGGCGATGCC GACACGTCGG GCCGACGGTG CCTCCACAGG CGGTCCGCCG
ACCGGCGGCG CCGGGATGAG AGACGGTGGA ACAGGCCCGC GGATGCCCTA CGGCAGGGAT
GCCGGCGACG ACCTCGGACT CGACCGACGC CATGCCGACC GCACCGATCC TGATCGGCCC
GGTCGGCCCG GTCGGCCCGG TCGGCCCGGT CGGCCCGGTC GGCCCGGTCG GCCCGAGGCT
TACCCGCCGG GCGCGGGGCG GAGCCGGCCG AGTCCGGCGC GGTTCGGAGC CGATCAGCGG
GTCGCGGCGC GTCGCTCGGG GAGCCTGGCC TCCGCCGCGG GCACAGCCGA TTTCCCCGAC
GATGACGACG CTCGCGATTC CCGACAGGGA GATCGTGATT CCGACACCAT GCCTTTCCTG
CATCGGGTCG GGGTGGCCCT GGTGGTGCTG GCCGTGGCAC TCGGGGTGGG CGTCGGGGCC
GGTGCCGTCT GGGAAAAGGT CCGGCCCAGC GGCCGGACCG CCAATGCGGC CCCGGCGCCG
ACGGCAGCAT CGTCCGGCGG CCCCGCAGCG GCCGCGCCCA GCCCAAGCAC AGGGGCGGCT
GGCGGTCAGG CAGCTGGCGG TCAGGCGGCT GCGGGCCAGA TCGCGGTGCC GGCGGACTGG
ACCTCGTTCA CCGACACTGT GCAGAAGGCC ACCTTCTCCC ACCCGCCCGT GTGGAAGCAG
CGGCGCGACA ACACCGGCAT CTTCTACGGC GAGCCCGGCA CCGTCTCGGA GTACGGACCG
CAGATGATCG GGGTGGCCCG GGTCGCGGTG CAGGATCCGG TGGCAGCGCT CACGCAGGTC
CAGTCCGCCG AGTTCAACAC GGTCCCCGGT CTGACCAGGG ACCATTCCGG TCCGGCGACG
GACACCAGCG ATCAGCCCAC CCAGGAACTT GCCGGCTCGT ACGACCGGGA GGGGCAGCGG
GTCTCCTACC TCATGCGCAC GGTGTCGGTG GCCGGTGCCG TGTACGTGCT CATCGCGCGA
GTGTCGACCA ACGTCCTGGC GTCGCTCAAC ACGCTGATGG GCGCACTGCG GTCGTCGTTC
GCGCCGGCCT GA
 
Protein sequence
MEPYQPAGPD PRPSGYGPPT GPYDAAGPVT GPYHPVPASH GDPGHGDPGH GSPRQGSSIQ 
PEDIDSPSTG TSMRRALVPP ARTARTARAE SPRSRAVPGS TPTPAPAPGS TPGRRPPGWA
TDGRSAPRGG PDRQNEVGLP RARRAMPTRR ADGASTGGPP TGGAGMRDGG TGPRMPYGRD
AGDDLGLDRR HADRTDPDRP GRPGRPGRPG RPGRPGRPEA YPPGAGRSRP SPARFGADQR
VAARRSGSLA SAAGTADFPD DDDARDSRQG DRDSDTMPFL HRVGVALVVL AVALGVGVGA
GAVWEKVRPS GRTANAAPAP TAASSGGPAA AAPSPSTGAA GGQAAGGQAA AGQIAVPADW
TSFTDTVQKA TFSHPPVWKQ RRDNTGIFYG EPGTVSEYGP QMIGVARVAV QDPVAALTQV
QSAEFNTVPG LTRDHSGPAT DTSDQPTQEL AGSYDREGQR VSYLMRTVSV AGAVYVLIAR
VSTNVLASLN TLMGALRSSF APA