Gene Franean1_0451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0451 
Symbol 
ID5668873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp531590 
End bp533383 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content66% 
IMG OID641239383 
ProductABC transporter related 
Protein accessionYP_001504821 
Protein GI158312313 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component
[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCACGC CGGCATCAGG GACCGACACC ACCAAGCGCA GGTCGACGCT GCGCCGACTA 
TGCAAGGACC CGCAGGCAGT CGTCACGTCG AGTGTCCTGC TGATCATCGC ACTGCTCGGC
GCGCTGGCTC CACTACTCAC GTCGCACGGC CCGAACGAAG CCTCGCTGGA TGCGATCGAC
GCCGACGTCA GCACCCCGGG CTACCCACTC GGTGGTGATC AGAGCGGTCG GGACATCTTT
GCGCGACTGC TCGGTTCCAT CAATACGAGC ATGGTCTCCG CCCTGATCGG AACGACTGTC
GCATTGACGA TCGGCGTCAC TGGTGGTCTC ATCGGCGGCT ACTTCGGGCA CCGGATCAGG
GGCTTCACCG AATGGGTGTT CAACCTGGTG ATGACCTTCC CGGGCCTACT GCTGTTGATC
ATTCTTATGC CTGTGACCAA AGGTGACTAC CGCATCACGA TGATGATCTT CGGGGTGTTG
CTGTCGCCCG GTGTCTACCG GATCGTGCGC AATCAGGTTC TGAGTGTGCG CAACGAGTTG
TTTGTCGACG CCGCGCGTGT CTCCGGGCTG TCCAGCCCGA GGATCCTCCG GCGGCACATC
CTCGCCGCGG TCCGCGGTCC CGTCATCATC GCCACCGCAT TCCTCTTAGG AGCCTCCATC
GGAGTGCAGG CGGGCTTGGC CTTCCTGGGC GTCGGACGGA GCTCAGTGCC CAGTTTCGGC
TCCATGATCG CCGCCGGGTT CGAGAGTCTG TACGTCGAAC CACTGCAAAT CGTGTGGCCC
AGCGTCATGC TCGGCCTCAT CACGGCCTCG CTGGTTCTGT TCGGCAACGC GGTGCGGGAC
ACGCTAGGAG CGTCGCGGCG CCGACCGCGC AAGATCCACC CATCCGCGCC GGCCCGGCCG
GAGAAACCCG ACCCGTCCGA CGATGCACCT CCCGGACTCC TGGAGGTGCG GAATTTGACG
ATCGCCTACC CCTCCCCGTC CGGCGAACCG CGCGAGGTGG TCCGTGGGGT CACCCTGAGC
CTGCAGCCGG GTGAAACCCT CGGCGTCGTC GGCGAGTCCA GCGCGGGCAA GACCCAGATC
GCTTTCGCTG CCCTCGGCGT GCTGCCGCCG GGTGCGACGA TCACCTGCGG GTCCATCACG
TTCGACGGCC GCGAACTGCT CGGGCGGACC GAGCGTGAGC TGCGCGGTAT CCGCGGCAGA
TCCATCGCGT ACGTGCCGCA GGAGCCGATG TCGAACCTCG ATCCCTGCAC CACCGTGGGG
GCGCAGCTCG TCGAGGGTGT ACAGGCGTCC ACGCCGATGG CGCGGCGCGC CGCCCGCGAG
CGGGTGCTCG CCCTGCTGGG CCGTGTCGGC ATCCCGGATC CGAAACGCAC CTTCAACTCC
TATCCACACC AAATCTCCGG TGGCATGGCG CAGCGCGTGC TCATCGCCGG TGCCGTGGCC
AGCCGACCTC GGATACTCAT CGCCGACGAG CCGACGACCG CGCTCGATGT CACGATCCAA
GCGGACATCC TCGACCTGCT GCGGGACCTC CAACAGGAAC TGAACATGGC CGTCTTGCTC
GTGACACACA ACTTCGGCGT GGTGGCCGAC CTGTGCGACC GCGTCGCCGT CATGCGAAGA
GGCGAGATCG TCGAGATGGG GACCGCGCTC GACATCCTTG GCGAACCGCA ACACGAGTAC
ACCCGGATGC TGCTCGCCTC CATCCTCGAT GGGAGCACAA CACGCACCGA CGCGCCGAGC
GGGGAAATAT CCGCCCGCGA CAGCGTGGCG GTGCAGGTCG GAGAAGCGCG ATGA
 
Protein sequence
MTTPASGTDT TKRRSTLRRL CKDPQAVVTS SVLLIIALLG ALAPLLTSHG PNEASLDAID 
ADVSTPGYPL GGDQSGRDIF ARLLGSINTS MVSALIGTTV ALTIGVTGGL IGGYFGHRIR
GFTEWVFNLV MTFPGLLLLI ILMPVTKGDY RITMMIFGVL LSPGVYRIVR NQVLSVRNEL
FVDAARVSGL SSPRILRRHI LAAVRGPVII ATAFLLGASI GVQAGLAFLG VGRSSVPSFG
SMIAAGFESL YVEPLQIVWP SVMLGLITAS LVLFGNAVRD TLGASRRRPR KIHPSAPARP
EKPDPSDDAP PGLLEVRNLT IAYPSPSGEP REVVRGVTLS LQPGETLGVV GESSAGKTQI
AFAALGVLPP GATITCGSIT FDGRELLGRT ERELRGIRGR SIAYVPQEPM SNLDPCTTVG
AQLVEGVQAS TPMARRAARE RVLALLGRVG IPDPKRTFNS YPHQISGGMA QRVLIAGAVA
SRPRILIADE PTTALDVTIQ ADILDLLRDL QQELNMAVLL VTHNFGVVAD LCDRVAVMRR
GEIVEMGTAL DILGEPQHEY TRMLLASILD GSTTRTDAPS GEISARDSVA VQVGEAR