Gene Apar_0806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0806 
Symbol 
ID8413671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp889185 
End bp890429 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content45% 
IMG OID645022388 
Productthiamine biosynthesis/tRNA modification protein ThiI 
Protein accessionYP_003179826 
Protein GI257784609 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.305799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAA GAGTATGTCT GGTTCACTAC CATGAAATTG GTTTAAAGGG TAAGAACCGA 
TCCACTTTTG AAAATCAGTT GGTAACAAAT TTACGACGAG CCCTCCGACA CTTTCCAGTG
GCGGGGGTTT CTCGCATTTC TGGCTACCTT CTTGTAGAAA CCACCGATAA ACAAGCAACT
GAAGAACTGC AACGTGCAGT AGCTCAGGTT CCTGGTGTTG CTCGCGCTTC TTTAGCATAT
CGTTGTGGCT TGGATGAGCA GGAGTTTTGT AACGCCGCTG TGAAGGCGCT GGGGGAGGCT
GGCAACTTTA CTACGTTTAA AGTGCATGCT CGCCGTTCCT CAACAGTCTA TCCAAGGCAT
TCCATTGAGC TTAATCAACT TGTTGGTGCA GTACTTTGTG AGAATTTTCC TGATAAAAAA
GTTCAGATGC ATGATCCTGA TATGACGGTG TATGTACATG TTGTTCAGGG CAACGCTTAC
GTATATGCTG CGTCTATTCG CGGTGTTGGG GGTCTTCCTG TTGGAACTGC AGGTAAGGTT
GTATCACTTC TTTCAAGTGG TATAGATTCG CCTGTTGCAA CATGGATGGT TGGTCGTAGA
GGTGCTACGG TAATTCCAGT TCACTTCTCT GGTCGTCCTA TGACTTCTGA TACAAGCGAA
TACCTGTGTC AGGATATTAT TGAGGCACTT GAGTCCGCTG GCCTTATTGG CAGAATGTAT
GTTGTGCCAT TTGGTGAGCA TCAGCGTGAG ATTTCCCTTG CGGTTCCACA AGCTCTCCGT
ATCATTATGT ATCGTCGCGT GATGTATATC ATTGCACAGA AGATTGCAGA GCTTGAGGGT
GCTAAAGCAC TGGTAACGGG AGAGTCACTT GGTCAAGTTG CTTCTCAGAC TCTCGACAAC
ATTGCTGCTG TCAACGAGGC TGTTACCATA CCTGTTTTGA GACCTCTGAT TGGCTCTGAT
AAGCAAGAAA TTATTCGTCG CGCCCAAGAT ATCAATACAT TTGATATTTC AACGCAGACG
GCTCCTGATT GCTGCACACT GTTTATGCCT CGTCGTCCTG AGACTCATGC AAGGATACGA
GAAGTTCTTG AGGCATGGAA TTCTTTTAAT CATGATGAAA TGATTGAAAA TCTTATGCAG
CATATTGAGT ATATTGACTT TAACCAGTGT CCTTCATATA AGCCACCTAA AGTATTGCCT
GCTCGTCATA AAGAGCTTGC TCCTGCTGAA TTTGTTACTG ATTAG
 
Protein sequence
MTERVCLVHY HEIGLKGKNR STFENQLVTN LRRALRHFPV AGVSRISGYL LVETTDKQAT 
EELQRAVAQV PGVARASLAY RCGLDEQEFC NAAVKALGEA GNFTTFKVHA RRSSTVYPRH
SIELNQLVGA VLCENFPDKK VQMHDPDMTV YVHVVQGNAY VYAASIRGVG GLPVGTAGKV
VSLLSSGIDS PVATWMVGRR GATVIPVHFS GRPMTSDTSE YLCQDIIEAL ESAGLIGRMY
VVPFGEHQRE ISLAVPQALR IIMYRRVMYI IAQKIAELEG AKALVTGESL GQVASQTLDN
IAAVNEAVTI PVLRPLIGSD KQEIIRRAQD INTFDISTQT APDCCTLFMP RRPETHARIR
EVLEAWNSFN HDEMIENLMQ HIEYIDFNQC PSYKPPKVLP ARHKELAPAE FVTD