Gene Franean1_2697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2697 
Symbol 
ID5671088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3191499 
End bp3192797 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID641241609 
Productputative branched-chain amino acid ABC transporter, amino acid-binding protein 
Protein accessionYP_001507029 
Protein GI158314521 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.614277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGACA CGGTAGTCCT ACACGGTCAT TCGACGGCGA GGCGGCACCG CCGACGCTCG 
GCGGGCATGC TCTTATCGGC GTCCCTGGCG CTGCTCGTGT TAGCCGCGGC GTGCGGTTCG
GATGACGGCG GGAGCACGCC CACCACGGTG GATTCATCCG CGGCCGCCGA CGCGCTCGGT
CCGGTCAGGA AGGCGGCGGG AACCCCCGTC AAGATCGGTA TCGTCTCGGA CGGCAGGTCC
GCCGCGATCG ACAACTCGGT GCAGTTCGCG GTAGCGAAGG CGACCGCGAA ATACCTGAAC
GAGCATCGCG GGGGGATCGG TGGTCGGCCC GTCGAGCTTG TGACGTGTGA GACGCAGGCG
GACCCGGCCA AGGGCACGGA CTGCGGCAAC CAGATGGTCG AGAAGGACGT CGTCGCGGTC
GCGGTCAGTG AGTCGGCGGT CGGTGACAGC GTCTGGCAGC CGCTGGCCGA CGCTGATGTG
CCGGCGATGT TATACAGCGC GACCAGCCCG ACGGCCCTCA CCGACCCGAC GACCTTCACG
GTGACCGATC CGAGTTTCAC GATTCAGCAG CTGCCGATCG CCCTCGCGAA GGAGAAGAAG
CTCAAGAAGG TGACGTTGGT GGCCATCGAC GTGCCCGCCT TGCTCTACAG CGTCCAGGAG
GTCGTGCCGA AGCAGATGGC GAAGGCAGGG CTCGACTACC AGCTCATCCG TATCCCACCG
GGCACAGCCG ACATGACGCC GCAGCTGCAG GGTGTCGCGG GCGGTGACCC AGGGTTGGTG
TTCGTCATCG GCAACGACTC GTTCTGTATC AGCGCCTTCA ACGGCCTGAG GGCGGTCGGG
TACGACGGCA GCATCGGCGC GATCTCGCAG TGCATCACCG ACGCGACCCG CAAGGCGGTG
CCGGGCGACG TGCTGGACGG CATGAGCGTC GCCGCCTCGA TGCCGGCCGG CGGGGATGAC
CCGTCCAGCG TCCTCTACAA CGCCGTGCTC GAGACCTACG GCAAGGACAT CGATGCCAGT
TCGTCCACCG GTCGGGGCAT GTTCGCCACC TTCGCGGGCC TCGCGGCAGC GCTTGAGGGC
ATCAAGGGCG ACGTCACCCC GGCGACGGCC GTGGCCGCCA TCAGATCGAT GCCGGAGAAG
GAGCTGCCGG GCGCGGGCGG GCTGAAGTTC CGCTGCAACG GCAAGGCCAA CCCCGAGACG
CCCGCGGTGT GCGTGCGGGG CGGACTGACG GCGAGCCTCG ACAGCGACGG CCAGGCCACC
GACTTCAACG TGGTCGGGAG CTCTCCGATC CCGGACTGA
 
Protein sequence
MYDTVVLHGH STARRHRRRS AGMLLSASLA LLVLAAACGS DDGGSTPTTV DSSAAADALG 
PVRKAAGTPV KIGIVSDGRS AAIDNSVQFA VAKATAKYLN EHRGGIGGRP VELVTCETQA
DPAKGTDCGN QMVEKDVVAV AVSESAVGDS VWQPLADADV PAMLYSATSP TALTDPTTFT
VTDPSFTIQQ LPIALAKEKK LKKVTLVAID VPALLYSVQE VVPKQMAKAG LDYQLIRIPP
GTADMTPQLQ GVAGGDPGLV FVIGNDSFCI SAFNGLRAVG YDGSIGAISQ CITDATRKAV
PGDVLDGMSV AASMPAGGDD PSSVLYNAVL ETYGKDIDAS SSTGRGMFAT FAGLAAALEG
IKGDVTPATA VAAIRSMPEK ELPGAGGLKF RCNGKANPET PAVCVRGGLT ASLDSDGQAT
DFNVVGSSPI PD