Gene Francci3_3455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3455 
Symbol 
ID3905695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4115137 
End bp4116393 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content71% 
IMG OID637880778 
Productputative lipoprotein 
Protein accessionYP_482538 
Protein GI86742138 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.235747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACGAC GGTACGGGCT CCGGTGGCTC GCCGCGGCGG TTCTGGGCGC CGTGCTGGCG 
GGAACGCTCG CCGCGTGCGG TGGCGACTCG GACGGCGAAG CCACCGGCAC GTCCTCGGCC
GGATCAACGG TCAAACTGAT GATCATCGCC CCGGTGGGCA CGTCCGGAAC GAACTACCCG
GAGATGGTGG CCGCCGCCCG GGCCGCCGTG CGCGGGGTCA ACGCCCGGGG TGGGATCGCC
GGCCACCGGG TCGAGCTCGT GCACTGCAAC GAGAAGAACG ACGCCGCCGT GGCCAGGAAG
TGCGCCCAGC AGGCGGTGGA TCAGCACGTG CTCGCGGTCG TCGGCGAGGT CAGCGGCTCG
GGTGGGATCA TGCCCATCCT GGAGAACGCG GGGATCCCGT CCATCGGTTC GGCCGGGATC
TCGGCCGACG GATCCGAGCT CAGCTCTCCG ATCAGCTACG TCATCAGCCC GCTGGCGCTC
TACCCCGCGG TCTGCCCGTC GTTGCTCGCC AAGGCGGGAG CCACCCGCCT CGGGCTGGTC
GGCTACGACC TCAGCGCCAG CGACCGGCTC ATCATCATGG CCGAGGGCGG CGCGAAGGCG
GTGAACCGTC CGATCAATCC GAAGATCCGT ATTCCGATCA CGACGAGCGA CTTCACCCCG
GCGATCTCCC AGCTCACCCG GTCCGGTGCC GACGGAGCCG TGCTCGTCGT GTTCGACCAG
GCCGCCTACG CGGTGATCTC CCAGGCCGGC CAGCGGGTCC GGACCTGCCA CGCGGCCGGA
ACCCTCTCGC CGAAGTACCT GTCGACGCTC GGTCCGGCCG CGGACAACCT CGTGGTCGCG
ACCGCGTTCC CCGAGCTGAG CCAGGCCGGC CAGTTCCCCG AGGTGGCCCG GATGATCTCC
GAACTGGACG CCGAGCAGGC CGGCGGGGAC GCGGACGCGG CCCCCGCGCT ACGCACCACG
ACGACCACCA CCGGCGCGTG GCTGTCCGTG CAGATCGCCG AGAAGGTGGG CAACAAGGTG
TCCGGCCCGC TCACCGCCCG CACCCTGCTC GACCAGCTCA ACCGGACCAC CGACCTGGAC
CTGGGCGGCA TCGTCCCGCG GCTGAACCTC ACGGCGACGA CGCCGATCCC CGGGGCGGAA
CGGATCTTCA ACACCACGCT GCGGGGAGCC CGCTGGGACA GCGCGTCGAA GAGCTTCGTG
CCACTCGGCC CGGAGACCTA CTCCGGCCTG GACATCCTGC GCCGGGCCGC CTCCTGA
 
Protein sequence
MGRRYGLRWL AAAVLGAVLA GTLAACGGDS DGEATGTSSA GSTVKLMIIA PVGTSGTNYP 
EMVAAARAAV RGVNARGGIA GHRVELVHCN EKNDAAVARK CAQQAVDQHV LAVVGEVSGS
GGIMPILENA GIPSIGSAGI SADGSELSSP ISYVISPLAL YPAVCPSLLA KAGATRLGLV
GYDLSASDRL IIMAEGGAKA VNRPINPKIR IPITTSDFTP AISQLTRSGA DGAVLVVFDQ
AAYAVISQAG QRVRTCHAAG TLSPKYLSTL GPAADNLVVA TAFPELSQAG QFPEVARMIS
ELDAEQAGGD ADAAPALRTT TTTTGAWLSV QIAEKVGNKV SGPLTARTLL DQLNRTTDLD
LGGIVPRLNL TATTPIPGAE RIFNTTLRGA RWDSASKSFV PLGPETYSGL DILRRAAS