Gene Franean1_3698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3698 
Symbol 
ID5672064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4378183 
End bp4379889 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content70% 
IMG OID641242581 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001508001 
Protein GI158315493 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTAA AAGTCTACGA GCGCATCCTC CAGTTGTTCG AAGCCGAGGG CATCAAGACG 
ATCTTCGGTA TTCCCGACCC GAACTTCGTG CACATGTTCC ACCTCGCCGA GGAACGCGGC
TGGACCGTGG TCTCACCGCA CCATGAGGAG TCCGCCGGGT TCATGGCGGA GGCGGTGTCC
CGGATGACCG GCAAGGCCGC GGTGGCGATC GGCACCCTGG GCCCGGGTGT CGCGAACCTG
GCCGGGGCGA TGATGTGCGC CAAGGTCGAG AACTCGCCGG TCATCTTCCT CGGCGGGCAG
CGTGCCCGGA TCACCGAGCA GCGGGTGCGG CGCGGCCGGA TCCAGTTCGT GCGGCAGGCG
GGCCTGTTCG AGCCGTCGGT GAAGTACTGC GCCAGCATCG AGTACGCCGA CCAGACCGAC
GAGGTCATCC GCGAGGGCCT GCGCAAGGCC CTGTCCGGCA CCCCGGGCCC GGTCTACATC
GAGTACCCCT CCCACATCAT CCAGGAGGAG CTCGACGTTC CCCCGCCGCT GCCGCCGGCG
GCGTACCGGC TGGTGAACCA GACCGCCGGC CCCGACAGGA TCGCCGAAGC CGTGCAGTAC
ATCCGCGCGG CCAAGCAGCC GGTCCTGCTC GTCGGGCACG GTGTGCACAC CTCCCGCGCG
GGAGAGTCCG TCCGCGCCCT CGCGGAGGCG ATGGCCTGCC CGGTCATCCA GACCTCCGGC
GGCACCTCCT TCATCAAGGG CCTGGAGGAC CGCACCTTCC CCTACGGCTT CTCCGCGGCG
TCGATCGACG CGGTGGTCAA GTCCGACCTG TGTCTGGCGA TCGGCACCGA GCTCGGGGAG
CCGGTCCACT ACGGCCGCGG CCGGCACTGG GTCGCGAACG AGGCCAACCG CAAGTGGATC
CTCATCGAGC AGGACCCCGA GGCCATCGGG GTGAACCGGT CGATCGACGT GCCCCTCGTC
GGTGACCTGC GCGCGGTCGT CCCGCAGCTC GTCGACGCCC TCAAGGACAC CCCGCGCACG
CCGACCCCCG AGCTCGACGG CTGGATCCGC CAGGACGCCG CCCAGCTGGC GGAGCTCGCG
GAGACTGCCC CGGCGGGCAT GTCACCCGTG CACCCGGCGC GCCTGATCGT CGAGGCCACC
AAGGTCTTCC CGCCGGACGG CATCATGGTG CGCGACGGCG GCGCCACCAC GATCTTCGGC
TGGACGTACT CACAGGCCAA GCCGCACGAC GTGATGTGGA ACCAGAACTT CGGCCACCTG
GGCACCGGGC TGCCCTACGC GGTCGGCGCC GGGGTGGCGG ACGGCGGCAA GCGCCCGATG
ATGCTGATCA CCGGGGACTC GTCCTTCCAG TTCCACATCG CCGAGCTGGA GACCGCCGCC
CGGCTGAACC TCCCGCTGGT CTGCGTCGTG GCCGTCGACT ACGCCTGGGG CCTCGAGGTC
GGCGTCTACA AGCGCACCTT CGGCCAGGGC TCGCTGGAGA CCGGCGTCCA CTGGAGCAAG
AACACCCGCC TGGACAAGGT CGCCGAGGGC TTCGGCTGTT ACGGCGAGTA CGTCGAGCGC
GACGAGGACA TCGCCCCCGC CATCAAGCGC GCCTACGCCA GCGGAAAGAC CGCCGTCATC
CACGTCGCGG TCGACCCGAA GGCCAACTCG GAGGAAATGC CGAGCTACGA CGAGTTCCGG
ACCTGGTACG CCGAGGGAAC GCAGTGA
 
Protein sequence
MAVKVYERIL QLFEAEGIKT IFGIPDPNFV HMFHLAEERG WTVVSPHHEE SAGFMAEAVS 
RMTGKAAVAI GTLGPGVANL AGAMMCAKVE NSPVIFLGGQ RARITEQRVR RGRIQFVRQA
GLFEPSVKYC ASIEYADQTD EVIREGLRKA LSGTPGPVYI EYPSHIIQEE LDVPPPLPPA
AYRLVNQTAG PDRIAEAVQY IRAAKQPVLL VGHGVHTSRA GESVRALAEA MACPVIQTSG
GTSFIKGLED RTFPYGFSAA SIDAVVKSDL CLAIGTELGE PVHYGRGRHW VANEANRKWI
LIEQDPEAIG VNRSIDVPLV GDLRAVVPQL VDALKDTPRT PTPELDGWIR QDAAQLAELA
ETAPAGMSPV HPARLIVEAT KVFPPDGIMV RDGGATTIFG WTYSQAKPHD VMWNQNFGHL
GTGLPYAVGA GVADGGKRPM MLITGDSSFQ FHIAELETAA RLNLPLVCVV AVDYAWGLEV
GVYKRTFGQG SLETGVHWSK NTRLDKVAEG FGCYGEYVER DEDIAPAIKR AYASGKTAVI
HVAVDPKANS EEMPSYDEFR TWYAEGTQ