Gene Haur_3818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3818 
Symbol 
ID5735682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4792284 
End bp4794206 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content55% 
IMG OID641280970 
Producthypothetical protein 
Protein accessionYP_001546582 
Protein GI159900335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00672536 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTGG GTCGATATGC TCGGCCATTG CTGCTGAGTA GCGTGGTTGC TGGGCTATTG 
TTATGGCTTG GTTGGCAGCG TTTAGTGCCA TTCAACCTAG CAATTGGCGG CGATTTAGTC
ATTGGCGAGT CGGGTGAGCA ATTTGCCTTG ATGTACGATC AGCCCTATCT TGAGCATGTG
CATGCCCCCG AACCGGCCAC GGTTGATCTC ACCACCACTG AAACCTATCG CTGGACCCAA
CCCGAATCAG CGATAACTGT ACCGTATCTC AATGGCAGTG CTCATATCGT TCGCTTGAGT
TTAGCGCCGC CATCAGTGCC ACAAACCCCG TTTTTGCTGC AAGCCAATGG CGCAGCGGTT
CGTACCGACC TGCCACCAGG CCAACGCACG TTGCATCTGT TCGCTCCTGC TAGTAGCGAT
GGCAGTTTAA CCCTCGAATT ACAAGCCCCA ACCTACAATG CTAGCCCTGA CCCACGTTTG
CTGGGAGTGG TGTTGTATCG GATGCAGGCT CAACCGTTAA GCCACGACTG GTTTCTGCCA
TGGGCCGCTT GGATCGATTT GCTGCTCGTG GTAATTGTGG TTGGGCTTGG TGCTGCTTTG
GCGGGTTTGG CTCCGTTGAC TGCGGCTGGC GCGATCTTGG TGACCAGCAG CGGATTAAGT
GTTTTGTTGG CAACAGTGCG GACGGTGATT ACGCTCGATA CTGGACAGTT GCGCACAATA
AGCTTGGCTT GTTTGCTGGT GGCTTGGCTT GGGCGTTGGT TGGCTGAGCG CCGCCAAAAC
CCCGAAATCG CCTTAGTTGC TGGAATGACC GCCTTGGGTT TGGCGCTGCG CTTGATCGGC
ATTCGTCATC CCCAAACCAA CTTCAGTGAC CTCTTGTTGA ATGTTAATAA TCTGGCGAGT
GTGGGGAGCG GCGATTTACT ATTTACCGAA GGTTTGCCTT GTGCCGCAGG TGCTGGCCGT
TCGCCCTATC CTCCAGGCAC GTATCTCATC ACGCAACCGT TAAGTTTATT GCTGCCCGCC
AGCATGAATC GCGGCATTTT GATTCAGATA GTTGGTGCTG TTGCCGATGC GTTGGTGATT
CCGTTATTGT GGTGGCTGAT TGATCGCACC CGTGACGCGC GAACGCCAGC ACGGGCGGCG
TTATGGGCTG CCAGCCTCTA TCTTGCTCCC TTGGCGATGC TGCGGGCAAT GGTGATTGGT
GAATGGAGCA ATGTGCTGGG CCAGGCAATT GCCATGCCAA TCTTGGCGTG GCTGGGATTA
TGGCTTGCCA GCAACCAGCC GCGAGCATGG CAACCAGCCC TGATTGCTGG CTTGACGATC
GCCGCCTTAC AGCATAGTGG CACAATGTTA TCGCTCGGGT TGTGGGGCGT GGCCTTGGCG
GGATTCTTGG TTTGGCAAAA ACAATGGCAA GTGCTGGGTC GCTTGGTGGT GGTGGGGACA
AGTGCCGTTG TGTTGGCGGT TGGCTTGTAT TACAGCAATT TCTTGGGCGA TCCAACCCTG
GCCAATAATG GCGTGATTTG CCCAGCGCCA CGTCCGTTTG ACCAAAAATT GTGGGGCGTA
GTTTGGAACG ATCTGATTGC GCTTGATGGG CGGGTTCCGG CATGGTTTTG GTTGGTGGGT
TTAGGCGGTG CGTTTAGTTT GCGCCAAGGG TTATCACGGC TAGCAACTCC AATTTGGGCT
TGGCTGGCAA CCTTCGTGCT TTCGCTTAGC TCGTTGCTTT GGTCGGAGCA AACTGTGCGT
TGGTGGCTGT TTATCTTGCC CGCCTTAGCT TTGAGTGGTG GCGTAGGTTT AGCGACTTTA
GCCCAGCGTG GCCGTTTCGG GCGGGTAGCT GCGATTGCCG CGAGTTTATT CATCATTGCT
GCTTCGTTAG CCCTCTGGAC ACGCTTTATT ATCGAGTATC GCACGGGGGC GTTTGTTCCA
TAA
 
Protein sequence
MALGRYARPL LLSSVVAGLL LWLGWQRLVP FNLAIGGDLV IGESGEQFAL MYDQPYLEHV 
HAPEPATVDL TTTETYRWTQ PESAITVPYL NGSAHIVRLS LAPPSVPQTP FLLQANGAAV
RTDLPPGQRT LHLFAPASSD GSLTLELQAP TYNASPDPRL LGVVLYRMQA QPLSHDWFLP
WAAWIDLLLV VIVVGLGAAL AGLAPLTAAG AILVTSSGLS VLLATVRTVI TLDTGQLRTI
SLACLLVAWL GRWLAERRQN PEIALVAGMT ALGLALRLIG IRHPQTNFSD LLLNVNNLAS
VGSGDLLFTE GLPCAAGAGR SPYPPGTYLI TQPLSLLLPA SMNRGILIQI VGAVADALVI
PLLWWLIDRT RDARTPARAA LWAASLYLAP LAMLRAMVIG EWSNVLGQAI AMPILAWLGL
WLASNQPRAW QPALIAGLTI AALQHSGTML SLGLWGVALA GFLVWQKQWQ VLGRLVVVGT
SAVVLAVGLY YSNFLGDPTL ANNGVICPAP RPFDQKLWGV VWNDLIALDG RVPAWFWLVG
LGGAFSLRQG LSRLATPIWA WLATFVLSLS SLLWSEQTVR WWLFILPALA LSGGVGLATL
AQRGRFGRVA AIAASLFIIA ASLALWTRFI IEYRTGAFVP