Gene Franean1_0187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0187 
Symbol 
ID5668612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp227518 
End bp229479 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content74% 
IMG OID641239116 
ProductTrkA domain-containing protein 
Protein accessionYP_001504560 
Protein GI158312052 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0569] K+ transport systems, NAD-binding component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.228831 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.390007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCGGG CGGCGCGGCT CTACGCGACG CTGCTGTTCG GCCGGATGGG GCGGATCGGC 
CGATGGCTGC GCCGCCACCT GCTCGGCACG TTCCTCGTGC TCGGGCTGAT CGCCCTCGTG
CTGAGCCTCA TCGGCTCGTA CCAGCACTTC TCCGCGGAGC CCGAGACGTT CACCTGGGCG
AACGTGATCT TCTTCGCCTC GACGCTCTTC CTCGCCGACG GCACGATGTT CGAGAACGGC
GGGCAGTTCC CCCCGGCGCT GGAGATCGCC CGTTTCCTCG CCCCGCTGGC GACGGCCGTC
GGTGTCGCCG ACGCCGCCAG CACCCTGTTC GCGCACCGGT TCGAACGGTT CCGCGCCCGG
CACGCGCACC GGCATGTCAT CGTCTGCGGC ACCGGGCCGA CCGCCTCCGC GCTCGTGGAC
AAGCTCGTCG GGAACAGCCG GGTCGTGCTC GTCGCCGAGG ACGCCGAGCG GGAGTATCCC
GACGCGGAGC GGCCCCCGAA CCTGCTGCGG GTCATCGGTG ACCCGGTGGA GCGGCTCGTC
CTGGCCAGCG CGGGGATCAT GCGGGCCGAC GTCGTCTACA GCTGCCTGGA CGACACCGCC
TCCAACCTCG CCGTCGCGCT CACCGCCCGT GGCGTCGTGC GGACCAGCCG GGCCGGCAGC
CGCCGGATCT CCTCCGCCCG GCTGGACCAT CCGCTGCGCT GCCTGGCCCA GGTGGGCGAC
CTCTCGCTGC TGCCGCACCT GCGGGCCCGC CGCATCGGCC TGGAGAACGA CCCTGGCTTC
CGCCTCGACT TCTTCGCGGT GGAGGTGCTC GGCGCGCACG CCATGCTGAA CGGCAACGCG
CCCGCCTGGG CCCGGCCGGA CGAGTTTCCC GACCTGCGGG CGCGCCCGGC GCCGGTCGTC
GTTCTCGGGC TGTCCGACCT GGGCCAGGCC GTGGTGATGG AGCTGGCGCG GCGCTGGCGC
GACTACAGCT CGCCCGGCTC GCCACCGCTG CGGATCGTCC TGTCGGGGCG GAACGCGACC
GTCGAGGCCG CGGCCATGCG CTCGCGGGAG CCGGCGCTGG CCCGGGTCGA GCTGTTGGCC
AACGACAGTC CGTCCGGGGA GCTGCCCGTG GCGGTCCTCG AGCTCGACGA CCCGGTCGAG
GGCGCCCGGA CCGTCCCGCC GGAGTTCGTC TACGTCTGCC ACGGCGACGA GGAGGAGGCG
CTGCTGCGCG GGCTGGAGGT GGCGCGCACG CTGGGCGCCG CCAGCCTCGG CGAGCATGGC
ACCCGGGTCG TCGTGCGCAC CGGCCGGCAG CGCAGCTTCG AGGACGTCTT CGGCCCGCGC
GAGGCGCCCG ACCCGGACGG CCCCGACGAC GACCCGGACG CGCCCGTCCC GCCGCCGCCC
GGCCCGGGGC GGCGTGGCGG CGGTGCGCTG CTGGACGACG TCCAGGGCGG GTTGCGGTTC
TTCGCGGTCA ACGACGAGGC GCTGCCCCTC GATCCCGGGG CGAACGACCT CATCGAGCGC
TTCGCGCGGA TCATCCACGA GAAGTACCTG TTCAAGGAGA TGACCGGCGG CGCGGTCCTG
CGTTCGCGGC GGAGCCTGCG CCCCTGGGAC GAGCTGGACG ACGATCTGCG GGCCGCCAAC
CTGGCGCAGG CGATCGGGTA CAGCGACGTC CTGCGCCGGC GGAACTGGAT GCTGATGCCG
GCCGGGGAGC ACGACCCGGA GTTCGTCTTC ACGCCGGAGG AGCTGGAGGA GCTCGCCCAG
GCGGAGCATG CCCGCTGGCG GCGCGAGCGG GAGAGCCGCG GGGTGCGGTA CGGGCCGGTG
GAGCGGGGCG GCTCCGATCC GCGTCACCCG TCCCTCGTCG ACTGGGAGCA GCTCTCCCCG
GCGGACCAGG ACCGCGACCG CGACGTGGTG CGCAATATGC CCCAGGTGCT GGCCACAGCC
GGCCTGCGCA TCGTGCGGAT GACACCGCGC CCACCCGACT GA
 
Protein sequence
MARAARLYAT LLFGRMGRIG RWLRRHLLGT FLVLGLIALV LSLIGSYQHF SAEPETFTWA 
NVIFFASTLF LADGTMFENG GQFPPALEIA RFLAPLATAV GVADAASTLF AHRFERFRAR
HAHRHVIVCG TGPTASALVD KLVGNSRVVL VAEDAEREYP DAERPPNLLR VIGDPVERLV
LASAGIMRAD VVYSCLDDTA SNLAVALTAR GVVRTSRAGS RRISSARLDH PLRCLAQVGD
LSLLPHLRAR RIGLENDPGF RLDFFAVEVL GAHAMLNGNA PAWARPDEFP DLRARPAPVV
VLGLSDLGQA VVMELARRWR DYSSPGSPPL RIVLSGRNAT VEAAAMRSRE PALARVELLA
NDSPSGELPV AVLELDDPVE GARTVPPEFV YVCHGDEEEA LLRGLEVART LGAASLGEHG
TRVVVRTGRQ RSFEDVFGPR EAPDPDGPDD DPDAPVPPPP GPGRRGGGAL LDDVQGGLRF
FAVNDEALPL DPGANDLIER FARIIHEKYL FKEMTGGAVL RSRRSLRPWD ELDDDLRAAN
LAQAIGYSDV LRRRNWMLMP AGEHDPEFVF TPEELEELAQ AEHARWRRER ESRGVRYGPV
ERGGSDPRHP SLVDWEQLSP ADQDRDRDVV RNMPQVLATA GLRIVRMTPR PPD