Gene Hhal_1902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1902 
Symbol 
ID4710676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2093831 
End bp2095954 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content69% 
IMG OID639856375 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001003468 
Protein GI121998681 
COG category[L] Replication, recombination and repair 
COG ID[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01406] DNA polymerase III, epsilon subunit, Proteobacterial 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGCG CCATGGCACG TCGCCGGTTG AAATTCTGGG CACCGGCGCT GGTCCTGCTG 
CTGCTGATCG CCGCTGCCCT GGCCACCGTC GGCTACCTGA CCCTGGGACA CGTTCCCGAG
GGACCGGAGC GCGCCAATGC GCTGCTCGCC CTGGGAGGCG CCGGCGCGGT GCTCATCGGC
GCCACCGTGG CCATCTGGAT CCTCCTGGAC GCCACCGTCA TCCGCCCCCT GGGCGCGCTG
GCCCGCGGGG CGTCGATCAT GGCCCACTCG AACCCGGCCC ACGAACTGGA GATCCCGAGC
ATGCACCTGC TCGGCGAGCT CCCGGAGAGC CTACAGACGC TGGGCAGCAA CCTCTATGAG
ACCCGGCGCG AAGTGGCCAA GGCCCTGGGC TCCGGCGCCC AGGGGGTCGA GAACCAGAAG
ACGCGGCTGG AGATCGTTCT GCGCGAGATC CAGCAGGGGG TCATCGTCTG CGACACCGAG
GGCCGGGTGC TGCTCTACAA CCCGGCCGCC GGCGAGATCC TGCGCAGTGA TGCACTGGGT
CTGGGGCGCT CCATCTACGA CGTGCTGGCC CGCTCGGCGG TGGACCACAC CCTGGAGATG
CTCCAACACC GCCTGGCCAT CGCCGACGAC CACACCGTGG CGGAAAACCG CGCCGAATTC
GTCTGCGCGA CGGTGGATGA CGGCGCCCTG CTGCACTGCC GGATGAGCCT GCTCCCCTCC
TCCAGCCCCC TGCGCTCGGG GTTCGTGCTG ACCATCGAGG ACATCACCCG CAAGATTGAA
GGCGTAGCCC GCCGCGACCA CGCCCTGCGC ACCGCCGTGG AGGCCCTGCG CCCGCCGCTG
GCGAACCTGC GCGCCGCCGC CGAGAGCCTC GGCCAGGGCG ACGAGGCCAT GGCCCGCGAG
CAGCGCCAGT CGTTCGAGAC GCTCATGGTC CACGAAAGCC GGGAGCTGAG CCGACGCTTC
GAGCGCATGG CCCGGGAGAC CCACCGACTC GTCTCGGCCC CCTGGACCAT GGCGGACATC
AGCAGCGCGG ACCTATTGGC CAGCGTCCTG CGCCGCCACC CGGAGGGATT GCCGAAGGTC
GAGGTGGTCG GCATCCCGCT GTGGATGCAC GCCGAGAGCC ACTCCATCGG CCTGGTACTG
GAGCACATGC TGCGCCACCT GCGCGGCGAA CTCGGCATCG AGCAGATCCG CGCCGAGCCC
CTGATGGGGG ATCAGCGGGT GTATCTGGAT CTCTCCTGGC AGGGCCCACC GATCCCGCCG
GATCAGCTCG AGACGTGGCT GGACGAGGAG CTGCCCGAGG CCACCGGGCA ACTGACCCCG
CGCGGGGTGC TGGAGCGCCA CGACAGCGTC GCCTGGAGCC AGATACAACC TCGCTGCGAA
GGACGCGCCC TGCTGCGCAT CCCGGTGCCG CTATCGCGGC GCCAGTGGGA GCAGCCCGGC
GAGCGCCTGC CACCGCGGCC GGAGTTCTAC GACTTCTCGC TGGCCGACCA GGCCGCCGAT
CAGGGCGAGC TGCTCGATCG CCCCCTGGCC CAGCTATCAT TCGTCATCTT CGATACCGAG
ACCACCGGAC TGGCCCCCTC GGAAGGCGAC GAGATCATCT CCATCGCCGG GGTACGCATG
GTCAATGGCC GCATCCTCGA GGGCGAGTGC TTCGAGCAGC TGGTCAACCC CGGGCGGCCG
ATCCCCAAAG CGTCGATCAA GTTCCACGGC ATCCGCGACG AGATGGTCGC CGACAAGCCG
GGGATCGCCA CGGTGCTGCC GCAGTTCAGC GCCTTCGTCG GCGATTCCGT GCTGGTCGCC
CATAACGCCG CCTTCGACAT GAAGTTCATC CGCCTCAAGG AAGGTCAGTG CGGCCTGAAG
TTCGAAAACC CGGTGCTCGA CACCCTGCTG CTGTCGGTCT TCCTGCACGA TCACACCCCT
GAGCACACCC TGGAGGCCAT CGCCAACCGC CTGGGGGTGG AGATCAGCGG CCGCCACACG
GCGCTGGGCG ACACCCTGGT CACCGGCGAG ATCTTCGCGC AGATGCTGCC GCTGCTCGAG
GAGCGCGGCG TCACCACCCT GCGCGATGCG ATCAACGCCT CCGAACAGAT GGTCGAGGTC
CGCAAGCAGC AGGCCCAGTT CTAA
 
Protein sequence
MASAMARRRL KFWAPALVLL LLIAAALATV GYLTLGHVPE GPERANALLA LGGAGAVLIG 
ATVAIWILLD ATVIRPLGAL ARGASIMAHS NPAHELEIPS MHLLGELPES LQTLGSNLYE
TRREVAKALG SGAQGVENQK TRLEIVLREI QQGVIVCDTE GRVLLYNPAA GEILRSDALG
LGRSIYDVLA RSAVDHTLEM LQHRLAIADD HTVAENRAEF VCATVDDGAL LHCRMSLLPS
SSPLRSGFVL TIEDITRKIE GVARRDHALR TAVEALRPPL ANLRAAAESL GQGDEAMARE
QRQSFETLMV HESRELSRRF ERMARETHRL VSAPWTMADI SSADLLASVL RRHPEGLPKV
EVVGIPLWMH AESHSIGLVL EHMLRHLRGE LGIEQIRAEP LMGDQRVYLD LSWQGPPIPP
DQLETWLDEE LPEATGQLTP RGVLERHDSV AWSQIQPRCE GRALLRIPVP LSRRQWEQPG
ERLPPRPEFY DFSLADQAAD QGELLDRPLA QLSFVIFDTE TTGLAPSEGD EIISIAGVRM
VNGRILEGEC FEQLVNPGRP IPKASIKFHG IRDEMVADKP GIATVLPQFS AFVGDSVLVA
HNAAFDMKFI RLKEGQCGLK FENPVLDTLL LSVFLHDHTP EHTLEAIANR LGVEISGRHT
ALGDTLVTGE IFAQMLPLLE ERGVTTLRDA INASEQMVEV RKQQAQF