Gene Franean1_3715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3715 
Symbol 
ID5672081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4397016 
End bp4398722 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content69% 
IMG OID641242598 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001508018 
Protein GI158315510 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTAA AAGTCTACGA GCGCATCCTC CAGTTGTTCG AAGCCGAGGG CATCAAGACG 
ATCTTCGGTA TTCCCGACCC GAACTTCGTG CACATGTTCC ACCTCGCCGA GGAACGCGGC
TGGACCGTGG TCTCACCGCA CCATGAGGAG TCCGCCGGGT TCATGGCGGA GGCGGTGTCC
CGGATGACCG GCAAGGCCGC GGTGGCGATC GGCACCCTGG GCCCGGGTGT CGCGAACCTG
GCCGGGGCGA TGATGTGCGC CAAGGTCGAG AACTCGCCGG TCATCTTCCT CGGCGGGCAG
CGTGCCCGGA TCACCGAGCA GCGGGTGCGG CGCGGCCGGA TCCAGTTCGT GCGGCAGGCG
GGCCTGTTCG AGCCGTCGGT GAAGTACTGC GCCAGCATCG AGTACGCCGA CCAGACCGAC
GAGGTCATCC GCGAGGGCCT GCGCAAGGCC CTGTCCGGCA CCCCGGGCCC GGTCTACATC
GAGTACCCCT CCCACATCAT CCAGGAGGAG CTCGACGTTC CCCCGCCGCT GCCGCCGGCG
GCGTACCGGC TGGTGAACCA GACCGCCGGC CCCGACAGGA TCGCCGAAGC CGTGCAGTAC
ATCCGCGCGG CCAAGCAGCC GGTCCTGCTC GTCGGGCACG GTGTGCACAC CTCCCGCGCG
GGAGAGTCCG TCCGCGCCCT CGCGGAGGCG ATGGCCTGCC CGGTCATCCA GACCTCCGGC
GGCACCTCCT TCATCAAGGG CCTGGAGGAC CGCACCTTCC CCTACGGCTT CTCCGCGGCG
GCCGTGGAGG CCGTCGTGGA GTCAGATCTC TGTCTCGCCA TCGGCACCGA AATCGGGGAG
CCGGTCCACT ACGGCCGCGG CCGGCACTGG GTCGCGAACG AGGCCAACCG CAAGTGGATC
CTCATCGAGC AGGACCCCGA GGCCATCGGG GTGAACCGGT CGATCGACGT GCCCCTCGTC
GGTGACCTGC GCGCGGTCGT CCCGCAGCTC GTCGACGCCC TCAAGGACAC CCCGCGCACG
CCGACCCCCC GGCTTGAGGC CTGGGTGACG CAGGAGGCAG CCCGGTTGGC GGAACTGGCG
GAGACGGCCC TATCCGGCAT GTCACCCGTG CACCCTGCAC GGCTCATCGT CGAGGCCACC
AAGGCCTTTC CCGCCGACGG CATCATGGTG CGCGACGGCG GCGCGATCAC CATTTTCGGC
TGGACGTACT CGCAGGCCAA GCCGCACGAC GTGATCTGGA ACCAGAACTT CGGCCACCTC
GGGACCGGGC TCCCCTACGC CGTCGGCGCC TCGGTGGCGG ATGGCGGGAA ACGGCCCGTC
ATGCTCATCA CCGGGGACTC GTCCTTCCAG TTCCACATCG CCGAGCTGGA GACCGCCGCC
CGACTGAACC TCCCGCTGGT CTGCGTGGTC AGCGTGGACT ACGCCTGGGG CCTCGAGGTC
GGCGTCTACA AGCGCACTTT CGGCCAGGGC TCGCTGGAGA CCGGCGTCCA CTGGAGTGAA
GACGTCCGGC TCAACAAGGT CGCCGAGGGC TTCGGCTGTT ACGGCGAGTA CGTCGAGCGC
GACGAGGACA TCGCCCCCGC CATCAAGCGC GCCTACGCCA GCGGAAAGAC CGCCGTCATC
CACGTCGCCG TCGACCCGAA GGCCAACTCG GAGGAAATGC CGAACTACGA CGAGTTCCGG
ACCTGGTACG CAGAGGGAAT GCAGTAG
 
Protein sequence
MPVKVYERIL QLFEAEGIKT IFGIPDPNFV HMFHLAEERG WTVVSPHHEE SAGFMAEAVS 
RMTGKAAVAI GTLGPGVANL AGAMMCAKVE NSPVIFLGGQ RARITEQRVR RGRIQFVRQA
GLFEPSVKYC ASIEYADQTD EVIREGLRKA LSGTPGPVYI EYPSHIIQEE LDVPPPLPPA
AYRLVNQTAG PDRIAEAVQY IRAAKQPVLL VGHGVHTSRA GESVRALAEA MACPVIQTSG
GTSFIKGLED RTFPYGFSAA AVEAVVESDL CLAIGTEIGE PVHYGRGRHW VANEANRKWI
LIEQDPEAIG VNRSIDVPLV GDLRAVVPQL VDALKDTPRT PTPRLEAWVT QEAARLAELA
ETALSGMSPV HPARLIVEAT KAFPADGIMV RDGGAITIFG WTYSQAKPHD VIWNQNFGHL
GTGLPYAVGA SVADGGKRPV MLITGDSSFQ FHIAELETAA RLNLPLVCVV SVDYAWGLEV
GVYKRTFGQG SLETGVHWSE DVRLNKVAEG FGCYGEYVER DEDIAPAIKR AYASGKTAVI
HVAVDPKANS EEMPNYDEFR TWYAEGMQ