Gene Acry_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_2089 
Symbol 
ID5161688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp2301634 
End bp2303004 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content73% 
IMG OID640554011 
ProductOmpA/MotB domain-containing protein 
Protein accessionYP_001235207 
Protein GI148261080 
COG category[N] Cell motility
[S] Function unknown 
COG ID[COG1360] Flagellar motor protein
[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family
[TIGR03350] type VI secretion system OmpA/MotB family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.975848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA ATCCGTTTGC CGAACCAGAC GATTCCGACC GCACGATCAT CAGGCCGGCG 
CCGGGCGGCC GGCGTCCCCC GCCCGCGCCG CCGCCGCCGC CGCCAACCGG CGGTGGCGAC
CCTTGGAGCG AGCCGCCGCG GCCGGCATCC CCGCCGCCCG CGGGCGGCGC GGAGACGCTG
AATATCGGCT CGACGCCGCT GATGGCGGCC GCGGCCCCCC TGCTGCAGTT GCTCGCTCGC
TTGCGCAACA CCCTGGCCCA GCCCGATTCG GGCGATCTGC GCGAACGCAC CGCCCGCGCG
TTGCGCGACT TCGAGCAGGC GGGCCGCAAT GCGGGCATTC CGAACGACCA GCTGCGCCCC
GCGCATTACG CCCTGTGCGC CAGCCTCGAT GACGTGGTGC TCGCCACCCC CTGGGGCAGC
AGCGGCGCCT GGGCCGCGCG CTCGATGGTG TCCACCTTCC ATCAGGAGGT CCGCTCCGGC
GAGCGCTTCT TCGACATCCT CAAGCAGATC ATGCAGAACC CGGGCCGGTT CCTGCCGGTT
CTCGAGCTGA TGTATATCTG CCTCTCGCTC GGCTACATGG GGCGCTACCG GCTCAGTCCG
CGCGGCCCCG CCGAGATCGA CCGGCTGCGC GAGGATGTCT ATGCCGTCCT CCGCCGCGCC
CGCCCGGCCG CCAGCCCCGA ACTGGCGCCC CACTGGCAGG GCGTTGCGGC GCCCTACCGC
CCGCGCCGCC CGTCCCTGCC GGTCTGGGTC GCCGCGGTGG CGGCCGCCGG GGTGCTGGCC
CTTGTCTACG CGGCCTTCGA CTACGGGCTC GGCGGCCAGT CCGCCACGCT CTACGCGCAG
TCGGTCGCCG CCCACCCGGC GCGGATGCCG AAGATCGTCC GCGCCGCCGC CGTCGTGCCG
CCGCCGCCGC CGGTCACCAC CGGCCCGAAC GTGCTCGACC GGCTGCGCGG CTTCCTGCAA
CCCGAAATCA CCAAGGGCGA GGTCGCCGTG CTCGGCACCG TCAACGCGCC GGTCATCCGC
ATCAACAACA CCGGCCTGTT CGCCTCCGGC AGCGCGACGG TCGAGAGCAC CGCGCTGCCG
CTGATTTCCA AGATCGGCCA GGCGCTGGCG CGCGAGAAGG GCAAGGTGCA GGTGATCGGC
TATACCGACA GCCAGCCGAT CCACACGCTG CGCTTCCCCA ACAACCTGGT CCTCTCGGAG
GACCGCGCGA AGGCGGCGGC CGCCGTGCTC GACCGCGCGA TCGGCGATCA GAGCCGCATC
ACCGCGGAGG GGCGCGGCGC CGCCGACCCG ATCGCGACCA ACGCGACGCC GCAGGGGCGC
GCCCTGAACC GGCGGATCGA AATCGTGCTG ATCCGGAGTG AAACCCAATG A
 
Protein sequence
MSDNPFAEPD DSDRTIIRPA PGGRRPPPAP PPPPPTGGGD PWSEPPRPAS PPPAGGAETL 
NIGSTPLMAA AAPLLQLLAR LRNTLAQPDS GDLRERTARA LRDFEQAGRN AGIPNDQLRP
AHYALCASLD DVVLATPWGS SGAWAARSMV STFHQEVRSG ERFFDILKQI MQNPGRFLPV
LELMYICLSL GYMGRYRLSP RGPAEIDRLR EDVYAVLRRA RPAASPELAP HWQGVAAPYR
PRRPSLPVWV AAVAAAGVLA LVYAAFDYGL GGQSATLYAQ SVAAHPARMP KIVRAAAVVP
PPPPVTTGPN VLDRLRGFLQ PEITKGEVAV LGTVNAPVIR INNTGLFASG SATVESTALP
LISKIGQALA REKGKVQVIG YTDSQPIHTL RFPNNLVLSE DRAKAAAAVL DRAIGDQSRI
TAEGRGAADP IATNATPQGR ALNRRIEIVL IRSETQ