Gene Huta_2894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2894 
Symbol 
ID8385203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2973723 
End bp2974943 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content62% 
IMG OID644973972 
ProductPyrrolo-quinoline quinone 
Protein accessionYP_003131788 
Protein GI257053955 
COG category[S] Function unknown 
COG ID[COG1520] FOG: WD40-like repeat 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0786698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAC GAAGAAGCTT CCTCCGACGA AGTGGGGTCG CCGTCGCAAC AATCGGCGTC 
GCCGGTTGTT CGAGTGTGTT CGGGGGGTCG GAGAATGAAA CTACTGATAC ACCAACCGAA
ACGGGGACCG AAACCTCGTG GAGGAACAGC GAGTCTTATA TCGCGTCGAT AAGACGGAAC
ATCCAGGGAA GACCGGTGGT CGCCGGCGAT CTGGTGTACG TCGTCGAAAC GCACGGCGAT
CTGTACGCGC TCGACCGCGA GAGCGGATCC GTCGAGTGGC GGTTCGAAAT GGACGGAACG
GTGCAAGCCC CGACCGTCAC AGGCGACGTC GTCTACACCG GCGATTACGG TCCGGGGGCG
ACCTCCAACG AGATCCGGGA GAGTAACGGA ACCGCATACG CAGTCGACCG TCACAGCGGG
GAGGCCATCT GGGAGACCGA CGTGAGCGGC ATGCCGCTCA ACACGCCGGT GGTGCGAGAC
GACGTTGTGC ACTACGCGGC ACTTGACGAT GGCGTCCACA CGCTCGCGAC CGAGGACGGC
GCTCAGAACT GGTCGCACAC GTTCGATCAA GGCTCTCTTT TCATCACGGA ACCGGTCGTG
ATGGACGGCG CCCTCTTCGC CACCAATCGA CGAGGGGTGT TCGCGCTCGA CCTGGCCGCC
CAGGAGACCT ACTGGACGAG CGATCCGATC GAGTATACAA ATGACCCGCT AGCGACCGAT
AGCGACTGGC TGTACGTGGC GGCCACAACC GGCGTCAAGG CACTCTCGTT CGATACCGGG
GACGTCGAGT GGACCGGCGA GAGCAACGGC ACCCCCCAAG ACATCGCCGT CGCCGACCGG
GTGTATACTT GCACGTCTAA CCCGTCCGAG GTCGTGGCGT TCGACCCGGC CATGGGGTCG
AAAGCCTGGG GAGAAAGCAT CACTGGTGAT CCTGCAGGCC TTCTTCTCGA CGATGGAACC
CTCTACGTCG CCACGTACGA CTATACGTTC AATACCGGGC GGCTCCACGT GTTGGACACC
GAGGAGGAGA CGCTGGCGTC TGAAATCTCC TTCACCGTCG AGAGCGAAGG CGAATTAGAA
ATGGGTTCCC CACCGGCAGT CGAAGACGGC CTGTGTTACG TCGGCGACGC CGCTGGAAAC
GTCTATGCGA TCGACATCGA CGACGAATCG GTTCACTGGC AGACGACGCC GCATGAAGCC
ACCCCGACCG GCGATTCGTA G
 
Protein sequence
MATRRSFLRR SGVAVATIGV AGCSSVFGGS ENETTDTPTE TGTETSWRNS ESYIASIRRN 
IQGRPVVAGD LVYVVETHGD LYALDRESGS VEWRFEMDGT VQAPTVTGDV VYTGDYGPGA
TSNEIRESNG TAYAVDRHSG EAIWETDVSG MPLNTPVVRD DVVHYAALDD GVHTLATEDG
AQNWSHTFDQ GSLFITEPVV MDGALFATNR RGVFALDLAA QETYWTSDPI EYTNDPLATD
SDWLYVAATT GVKALSFDTG DVEWTGESNG TPQDIAVADR VYTCTSNPSE VVAFDPAMGS
KAWGESITGD PAGLLLDDGT LYVATYDYTF NTGRLHVLDT EEETLASEIS FTVESEGELE
MGSPPAVEDG LCYVGDAAGN VYAIDIDDES VHWQTTPHEA TPTGDS