Gene Francci3_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0101 
Symbol 
ID3902935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp123143 
End bp124507 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content71% 
IMG OID637877432 
Producthypothetical protein 
Protein accessionYP_479224 
Protein GI86738824 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.39004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCCTG GGGTGTGGCG TCCACCGGTG GAGCCCTCGG CTGCGGAACA AACAGTGATC 
AGGCTGGTGC GGCGGGCGAA GCTGTTCGTG TTCCTGCGCC GGTATCGGCA TGAGCTGTTC
GACGAGGCGT TCCAGACCGA GCTGGCCGAG GTCTACCGCG ACAGTCCGAA GGGCCAGCCG
CCGGTGCCGC CGGCGCAGCT GGCGTTGGCG CTGATCCTGC AGGCCTACAC GGGTATCTCC
GACGACGAGG TGATCGAGGC GACGGTGATG GACCGGCGCT GGCAGCTGGT GCTGGACTGC
CTGGACACCG ACCGTCCCCC GTTTGCCAAG GGCACCCTGG TGGGATTCCG TACCCGGCTG
ATCAACGCTG GGTGGGATCG GCGGCTGATC GCACGCACCA TCGAGATCGC CATGGTGAGC
GGGGGGTTCG GGCCGCGGGC GTTACGGGCG GCGTTGGACT CCAGTCCGTT GTGGGGCGCC
GGCCGGGTCG AGGACACCCC GAACATGGGA CACGCCCTGC GCAAGGCTCT GGGTGTGATC
GCGGCCGGGC AGGGGTGGGG GCTGGACGAA GGAACCGCCG TCCTGGCCCG CCGGGCTGGG
GCCCCGGTGC TGGCCGGATC CAGCCTGAAA GCCGCGTTGG ACGCCGACTG GGACGACCCC
GGCGAGCGGG ACCACGCTCT GGCGGTGGTG CTGGCCGCCC TGGAGGCGGT GGAGACCTTC
ATCGCTGCCC AGCCCGCCCC GGTCGGGGCG GCTGTCGCGG TGGCCCGCCA GGTCCGTGAC
CAGGACGTCG AGACCACCCC GGCCGGTGTG GCCCGGCTGC GCCGCGGGGT AGCGAAAAAC
CTGACCGAGC TGCACATCGA CCGGGCCTAC CTGCCCTCCA CTCTGGTCCG CGACCGCGAC
GAGAACCTGC AGGTGTTCTG TAAGGCGTGG CGAGTCCGCA ACACCACCGG CCGGTATGAC
GAGACCGCTT TCACCCTGAA CTTCGACCAC GGCCAGCTCA CCTGCCCGGC CGGGGTGGTC
ATGCCCTTCA CCCCGGGCCG CACCGTCCGT TTCCCAGCAG CGACCTGCGC GGCCTGTCCC
CTGCGGGAAC AGTGCACCAC CAGCACCCGC GGGCGCAGCG TGAGCATCCA TCCCGACGAG
ACTCTGCTCG CCGAACTCCG TGAACGCCAG GCGACCCCGG CCGGCCGCGC CCGCCTGCGG
GAACGCACCG CGGTCGAGCA CACCCTCGCC CACGTCGGCC ACTGGCAGGG CCGCCGCGCC
CGCTACCACG GGCAGCGCAA GAACCTGTTC TACCTCCGCC GCACCGCCGT CGTCCACAAC
CTCCACGTCA TCGCCCGCCA AAGAAACGAT CAGCAAGCAG CCTGA
 
Protein sequence
MRPGVWRPPV EPSAAEQTVI RLVRRAKLFV FLRRYRHELF DEAFQTELAE VYRDSPKGQP 
PVPPAQLALA LILQAYTGIS DDEVIEATVM DRRWQLVLDC LDTDRPPFAK GTLVGFRTRL
INAGWDRRLI ARTIEIAMVS GGFGPRALRA ALDSSPLWGA GRVEDTPNMG HALRKALGVI
AAGQGWGLDE GTAVLARRAG APVLAGSSLK AALDADWDDP GERDHALAVV LAALEAVETF
IAAQPAPVGA AVAVARQVRD QDVETTPAGV ARLRRGVAKN LTELHIDRAY LPSTLVRDRD
ENLQVFCKAW RVRNTTGRYD ETAFTLNFDH GQLTCPAGVV MPFTPGRTVR FPAATCAACP
LREQCTTSTR GRSVSIHPDE TLLAELRERQ ATPAGRARLR ERTAVEHTLA HVGHWQGRRA
RYHGQRKNLF YLRRTAVVHN LHVIARQRND QQAA