Gene Franean1_5609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5609 
Symbol 
ID5675768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6805135 
End bp6806955 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content71% 
IMG OID641244462 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001509866 
Protein GI158317358 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.567176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000155826 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGTGCGTC AGATCCGCCC GACGGATTCC GACAACGCGG CGGCCGCGGC CTCCGGCACG 
GACGACGGCG CGGAGGGCAT CCTGGCCCGG CCGTTCCCCG GGAGCAGGAA GACCTACCTG
GTCGGGTCCC GGCCCGACCT GCGGGTGCCG ATGCGCGAGG TCGCGCTGAG CACGGGCGAC
AGCCTGGTCC TCTACGACAC CTCCGGGCCG TACACCGACC CGGGCGCCGG CGTCGACGTG
CGGCGCGGCC TGCCCGCGCT GCGGGCCGGC TGGATCGCCG CCCGCGGCGA CACCGAGGAG
ATCGAGGGGC GTGCCGTCCA GCCCGCCGAC GACGGGCGGC GCGGGCCTGA CGGGCAGGCC
GGTGACACCT TTCTCGGCTC GGGGCAGGCT CCGCGGCGGG CTACCGCCGG CCGCGCGGTG
ACCCAGCTCG CCTACGCCCG GCGGGGCGAG ATCACCGCCG AGATGGAGTT CGTCGCGCTG
CGTGAGGGCC TCTCTCCCGA GCTGGTGCGC GACGAGATCG CCGCGGGCCG GGCGGTGCTG
CCGGCGAACG TGAACCACCC CGAGAGCGAG CCGATGGCCA TCGGCCGCAA CCTGCTGGTG
AAGATCAACG CGAACATCGG CAACTCCGCC GTCGCCTCCT CCATCGAGGA GGAGGTCGAG
AAGATGGTGT GGTCGACCCG CTGGGGCGCC GACACCGTCA TGGACCTCTC GACCGGGCGG
AACATCCACA CCACCCGCGA GTGGATCATC CGGAACTCCC CCGTCCCGAT CGGGACGGTG
CCGATCTACC AGGCGCTGGA GAAGGTCGGC GGCAAGGCCG AGGAGCTGAC CTGGGAGGTC
TACCGGGACA CGGTGATCGA GCAGGCCGAG CAGGGTGTGG ACTACATGAC CGTCCACGCC
GGGGTGCTGC TGCGCTACGT GCCGATGACC GCGCGGCGCA AGACCGGCAT CGTCTCGCGC
GGCGGCTCGA TCCTGGCGGC ATGGTGCCTC GCCCATCATG AGGAGAACTT CCTCTACACG
AACTTCGCCG AGCTCTGCGA GATCCTGCGG GCCTACGACG TCACCTTCTC CCTCGGCGAC
GGTCTGCGCC CCGGCTCCAT CTCCGACGCG AACGACGCCG CGCAGCTCGC CGAGCTCGCA
ACACTGGGCG AGCTGACCTC GGTGGCCTGG GAGCACGACG TCCAGGTGAT GATCGAGGGG
CCGGGCCACG TGCCCATGCA CAAGATCAAG GAGAACGTGG ACCTGCAGCG CGAGCTGTGC
CACGACGCGC CGTTCTACAC CCTCGGCCCG CTCACCACCG ACATCGCGCC CGGCTACGAC
CACATCACCT CGGCCATCGG CGCCGCGATG ATCGGCTGGT ACGGGACGGC GATGCTCTGC
TACGTCACGC CCAAGGAGCA CCTGGGCCTG CCGGACCGCG AGGACGTCAA GGCCGGGGTC
ATCGCCTACA AGATCGCGGC GCACGCCGCC GACCTGGCGA AGGGGCACCC GGGCTCCCAG
GCCTGGGACG ACGCCCTCTC CGACGCGCGG TTCGACTTCC GCTGGGACGA CCAGTTCAAC
CTCTCACTCG ACCCGGAGAC CGCCCGGGAG TACCACGACG AGACACTGCC CGCGGCGCCC
GCCAAGTCCG CGCACTTCTG CTCGATGTGC GGCCCGCACT TCTGCTCGAT GCGGATCACG
CAGGACGTCC GCAAGTACGC GGCCGACCAC GGCCTCGACA CCGACGAGGC CGTCCAGGCC
GGATTGGCGG AGAAGTCCCG GGAGTTCGCC GAGCAGGGTG CCCGGATCTA CCTGCCGCTC
GCCGACCGGC AGACCCCCTG A
 
Protein sequence
MVRQIRPTDS DNAAAAASGT DDGAEGILAR PFPGSRKTYL VGSRPDLRVP MREVALSTGD 
SLVLYDTSGP YTDPGAGVDV RRGLPALRAG WIAARGDTEE IEGRAVQPAD DGRRGPDGQA
GDTFLGSGQA PRRATAGRAV TQLAYARRGE ITAEMEFVAL REGLSPELVR DEIAAGRAVL
PANVNHPESE PMAIGRNLLV KINANIGNSA VASSIEEEVE KMVWSTRWGA DTVMDLSTGR
NIHTTREWII RNSPVPIGTV PIYQALEKVG GKAEELTWEV YRDTVIEQAE QGVDYMTVHA
GVLLRYVPMT ARRKTGIVSR GGSILAAWCL AHHEENFLYT NFAELCEILR AYDVTFSLGD
GLRPGSISDA NDAAQLAELA TLGELTSVAW EHDVQVMIEG PGHVPMHKIK ENVDLQRELC
HDAPFYTLGP LTTDIAPGYD HITSAIGAAM IGWYGTAMLC YVTPKEHLGL PDREDVKAGV
IAYKIAAHAA DLAKGHPGSQ AWDDALSDAR FDFRWDDQFN LSLDPETARE YHDETLPAAP
AKSAHFCSMC GPHFCSMRIT QDVRKYAADH GLDTDEAVQA GLAEKSREFA EQGARIYLPL
ADRQTP