Gene Franean1_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3221 
Symbol 
ID5671597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3802106 
End bp3803347 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content65% 
IMG OID641242115 
ProductABC-type branched-chain amino acid transport systems periplasmic component-like protein 
Protein accessionYP_001507535 
Protein GI158315027 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCAGA GACGATTAAG ATCATTTGTC GCGATGGCGG GTGCCGTTGC GGTGCTGGTG 
GCGGCTGCCG GCTGCGGCGG CTCGTCGTCG GACGGGAGTG GTGGTGATGC GGAGGCGCAG
GGTGGCAAGA CCTACACGAT CGGGGTCTTA GCCGATATCA CGGGCCCGGC GGCGTCCGGG
AACGAGACCA GCGTCGAGGG CGTCAAGGCG GGGACGTACT ACGCCGAGCG CGAGGGAATC
AAGATCAAGT ACATCGTGGC CGACACGGCG ACGAATCCGA CGACCGCGCT CTCGGCCGCG
CAGAAGCTGG TCACGCAGGA TCACGTGTTC GCCGTGATCG CCCACTCGGC GATCACGTTC
TCCGCGGCTT CCTACCTCAC CGCTCAGAAG GTCCCGGTCA TCGGTTTCGC CCAGGACGGC
CGAGAGTGGT TCACGTCCCC GAACATGTTC TCGATCACCG GGCCGACGAT CGACAAAGAA
GTCACGACGA CGATGGGCGA GTTCTTCAAG TCGAAGGGGG CGACCAGCAT CGCCTCGATC
GGTTACTCGG TCTCGCCCCA GTCGCAGGCC TCGGCGCTCG AGACGGCGGA GTCGGCCAGA
CTCGCGGGTG TCAAAATAGG CTACGTCAAC GCGCAGCTCC CGTTCGGTAG CACCGATGTC
GGTCCGACAG TGCTGGCCAT GAAGGAAGCC AAGATCGATT CCTTCTTCGC CGCGGTCGAC
CCGAACACCG CCTTCGCTCT CATCTCCGGC CTGGAACAGC AGGGCGTGGA CATCAAGGTG
GCGCTGCTGC CCACCGGCTA TGGCGGTGAC CTGGCGCAGG CTGGCCCGGG CGCGCGGCGA
GCGGCTCAGG GTGTCTACTT CTCCCTCGGA TACCAGCCCG TCGAGATGCA GACAGCCGCT
ACCAAGCAGT TCCAGAGCGA CCTGAAAGAA GCGGGGATCA CCGGGGCGCC GACGCTCGCG
CATTACAACG GGTACATCTC GGTCGGTCTG CTCGTCCGGG CTCTCAAGGC GGCTGGCGCG
GATCCGACGC CGGCATCGCT CACCAAGGCG CTCGCCGGAA TCCATGACTG GGACGGCCTC
GGCCTCTACG GGGCCACGAA GTACGACCTC AGCCAGAAGA AGATCTCGAC CGGCGAGTGC
CTGTTCATGA GCAGACTGGA CGGCAGCACG TTCAAGCCCG TCCCCGACGC TATCCCCATC
TGCGGCGACC TGACCGACGA GAAGGTCACG CTCTCGTCCT GA
 
Protein sequence
MIQRRLRSFV AMAGAVAVLV AAAGCGGSSS DGSGGDAEAQ GGKTYTIGVL ADITGPAASG 
NETSVEGVKA GTYYAEREGI KIKYIVADTA TNPTTALSAA QKLVTQDHVF AVIAHSAITF
SAASYLTAQK VPVIGFAQDG REWFTSPNMF SITGPTIDKE VTTTMGEFFK SKGATSIASI
GYSVSPQSQA SALETAESAR LAGVKIGYVN AQLPFGSTDV GPTVLAMKEA KIDSFFAAVD
PNTAFALISG LEQQGVDIKV ALLPTGYGGD LAQAGPGARR AAQGVYFSLG YQPVEMQTAA
TKQFQSDLKE AGITGAPTLA HYNGYISVGL LVRALKAAGA DPTPASLTKA LAGIHDWDGL
GLYGATKYDL SQKKISTGEC LFMSRLDGST FKPVPDAIPI CGDLTDEKVT LSS