Gene Franean1_4614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4614 
Symbol 
ID5672959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5499574 
End bp5500770 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content71% 
IMG OID641243475 
ProductABC transporter related 
Protein accessionYP_001508891 
Protein GI158316383 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA TCGTGCTCGA CCACGTCACC AAGCGGTTCC CGGACGGCCG GCTCGCGGTG 
GACGACGCCA GCCTCTCCAT CGCGGACGGC GAGTTCGTCA TCCTCGTCGG CCCGTCCGGG
TGCGGGAAGT CGACCACCCT GAACATGATC GCCGGCCTGG AGGACATCTC CTCCGGCGAG
CTGCGGATCG GCGGCAAGGT GGTCAACGAC CTGGCGCCGA AGGACCGCGA CATCGCCATG
GTGTTCCAGA GCTACGCCCT GTACCCGCAC ATGTCGGTCC GGAAGAACAT GGGCTTCGCC
CTGTCGCTGG CGAAACGACC GAAGGAGGAG ATCGACCGGT TGGTCGAGGA GGCCGCCCGG
GTCCTCGACC TCACCGAGCA CCTGGACCGC AAGCCGGCCC AGCTCTCCGG TGGGCAGCGG
CAGCGGGTCG CGATGGGCCG CGCGATCGTC CGCTCCCCGA AGGCGTTCCT GATGGACGAG
CCGCTGTCCA ACCTGGACGC CAAGCTGCGG GTGCAGATGC GCACCGAGGT GTCCCGCATC
CAGAACCGAC TCGGCACGAC CATGGTGTAC GTCACCCACG ACCAGACCGA GGCGATGACG
CTGGGCGACC GGGTGGCGGT GCTGCGGTCC GGGCGGATCC AGCAGGTCGG CACGCCGACC
GAGCTGTACG CCCGGCCGGC GACGGTGTTC GTGGCCGGTT TCATCGGCTC ACCCGCGATG
AACTTCGTGC CGGCGACCCT CGAGGAGGGC GAGCTGCGCA CCCCCCTCGG GACGATCGTC
CCGGACGAGC GGCAGCGGCG GCTGCTGGAG GGCTGGAACG GGACGGCGGC CGACCGTCGG
TCGCTGATCG TCGGGGTGCG GCCCGAGCAC TTCGAGGACG CGGCGCTGGA GCCGGCGAAG
GACCGCCCGG GACTGCGGTT CACGGTCACG GTGGACGTGC TGGAGCGGCT CGGCTCGGAC
AGCTTCGCCT ACTTCACCCT GGCGGGCGGG CGCGCCCGGA CCGCCGACCT GGAGGAGCTG
GCCCACGACG CGGGCACGGT CGAGCTGTCC GGCAAGGCGG AACAGGTGGT CGCCCGGCTC
GACGCGGCCA GCCGGATCCG GGAGGGCGAG AAGGCCGAGC TGTGGCTGGA CGTCCACCAG
CTGCACCTGT TCGACCCCGA CACCGGCCGC AACCTGGACG CCCCGGCCAC GGCATGA
 
Protein sequence
MADIVLDHVT KRFPDGRLAV DDASLSIADG EFVILVGPSG CGKSTTLNMI AGLEDISSGE 
LRIGGKVVND LAPKDRDIAM VFQSYALYPH MSVRKNMGFA LSLAKRPKEE IDRLVEEAAR
VLDLTEHLDR KPAQLSGGQR QRVAMGRAIV RSPKAFLMDE PLSNLDAKLR VQMRTEVSRI
QNRLGTTMVY VTHDQTEAMT LGDRVAVLRS GRIQQVGTPT ELYARPATVF VAGFIGSPAM
NFVPATLEEG ELRTPLGTIV PDERQRRLLE GWNGTAADRR SLIVGVRPEH FEDAALEPAK
DRPGLRFTVT VDVLERLGSD SFAYFTLAGG RARTADLEEL AHDAGTVELS GKAEQVVARL
DAASRIREGE KAELWLDVHQ LHLFDPDTGR NLDAPATA