Gene Haur_4812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4812 
Symbol 
ID5736657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6139949 
End bp6141037 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content50% 
IMG OID641281977 
Producthypothetical protein 
Protein accessionYP_001547570 
Protein GI159901323 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAGA TAGGTCGTCG AACAGTAACC AGTTTATCAG TAGGCTTGGT AGCGATCATA 
ATTTTGTTGG TCACGATTGG TCAACAACCC AGTGCAGGCC AAGTTGCTAA CAATACGCCA
AATCAAGCTC AAGCCCAAGT TAGTGGTATT TTATTTTTGC GCAATCCCGC TCAGCAAGCT
GAATTGTGGC GCTCGGATGC CAATGGTCAA GGGCAGCAAT TACTTGTACC GCAAGTTAGC
GATTATAGCC TTAGCCCCGA TGGGCGCAAA GTTGCCTATG CAACTCAAGC CGAAGCCCAA
CCAAGCCGGA TCGAAATGTT TGATCTGACC CAAAATCAGG TGATTACGAG CACAGGTTCG
GCTGATTGGA CTGGTTACAC GCCGAATTGG TCGCCTGCTG ATGGCGTGAT TGTCTATGAA
CGACGCACAA TTAGCACTGG CGGAGTTGGT TCGCCCAAAC TTTGGTTGAT GCAGCCCGAT
GGAACACAAG TTAGTCCGGT GGTTAAAGGC GGCGATGTGG TTACCTTTGG TGCACATTGG
TCGAATACTG GACGCTTGTT AGGCTTTACC GATCCATTGC GCAATGAATT GGTTTTATTT
GATTTTAGCG ATGTGTTGCG GCGGATTCCA TTTAGCGGCG ATTTTGATTG GTCACCCGAT
GATCAGCGTT TAGTAATTAG TGTGTTGCGG GAGTCGCAAG CAGGTTTTCG CAACGAATTA
ATCCTATTTG ATCTGATGAC CGAGCAACAA ACACCCTTGA CTAGCCAAAC CGACACTGAT
GATTTCACGC CCGTTTGGTC GCCTGATGGC ACAAAAATCG CCTTTGTGCG CCGCACCCGT
GAAGTGCCTC GCGGCGAAAT TTGGGTGGTT AATGCTGATG GCAGCGAGCC ACGGGCAATT
ACGGCGGGCG GCGGCTACGA TAACGTTGAT CCGCAATGGA CTCCCGATAG CCAACAATTG
CTCTGGACGC GCTTGACCGT GGGTTCGGCA AACGTACCCT CGGCAATCTG GACGGTTAAT
TTGGCTGAAA ATTCAGAGCC ACGGGTGTTG ATCGAAAATG CCACCCAAGC TCGTTGGATC
GTTGAGTAG
 
Protein sequence
MDQIGRRTVT SLSVGLVAII ILLVTIGQQP SAGQVANNTP NQAQAQVSGI LFLRNPAQQA 
ELWRSDANGQ GQQLLVPQVS DYSLSPDGRK VAYATQAEAQ PSRIEMFDLT QNQVITSTGS
ADWTGYTPNW SPADGVIVYE RRTISTGGVG SPKLWLMQPD GTQVSPVVKG GDVVTFGAHW
SNTGRLLGFT DPLRNELVLF DFSDVLRRIP FSGDFDWSPD DQRLVISVLR ESQAGFRNEL
ILFDLMTEQQ TPLTSQTDTD DFTPVWSPDG TKIAFVRRTR EVPRGEIWVV NADGSEPRAI
TAGGGYDNVD PQWTPDSQQL LWTRLTVGSA NVPSAIWTVN LAENSEPRVL IENATQARWI
VE