Gene Haur_4461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4461 
Symbol 
ID5736312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5704057 
End bp5706006 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content53% 
IMG OID641281624 
Producthypothetical protein 
Protein accessionYP_001547221 
Protein GI159900974 
COG category[S] Function unknown 
COG ID[COG1543] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.608634 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAC AAGGTGCGTT CACGTTTGTC TTACATAGCC ACTTGCCCTA TTGTCGCAAG 
GCTGGCCGCT GGCCTCACGG TGAAGAATGG ATTCACGAGG CCGCTTCCGA GACGTATATT
CCACTACTCA ATGCGCTCAA CGATCTGATC AACGATGGGG TTACACCACG CTTAACGATT
GGGATTACGC CAATTCTAAC CGAGCAGCTT GCTGACCCCA CCATTTTGCA CAATTTTGAA
GAATATCTTG ACGAGAAGAT CACTGCGGCG CAAGCTGATA TGGATCGACT GGCCGACGTT
CAGGCAGTTT GGGATGCTGC CCAAGTTGCC GAACCTGAAA CCGAGCCAAC TCCCCTGCTC
TCCAGCGAAG AGTTGGAAAG CTTGATCAGC AAAAGCGATG CGCTGCTTTC TTCAACTGCT
GGTGATGCTC CGGCCCCACT TCACGCTGGT TTGCTGAGCG CCACTGCTGC TGCCAGCACC
GAAGCCGAAG CAGATGATGA AGCGGAATCT GAAGAAACCG AGGAAGCAGC AGTTGAAGAA
CCTGCTCCGA TCGAGCAGCC TGATCCACAT AAAGCCTATT TGGCCATGTG GTATCGCGAT
TGGTACAGCA TGATCAAGCG TTCGTTTATC GAGCGCTACA ACCGCGATAT TGTCGGAGCT
TTCCGCCAAT TGCAAGATGC TGGCTATATC GAAATTATCA CCTGTGGCGC GACCCACGGC
TACTTGCCCT TGGTCAGCCG CGATTCAACA ATTTATGCCC AAATTGCCGT CGCTGTCCAA
AGCTACGAAC GTCATTATGG CCGCAAGCCA AAGGCGATTT GGCTGCCCGA GTGCGCCTAT
CGCCCAGCCT ATTATCCTGA AAACCCCAGC GAAACCGAGC GCAAGCCTGG CATCGAAGAA
TTTCTTGAAG CCCAAGGCAT CGAGTGCTTC TTCGTCGAAA CCACCACCAT CGAGGGTGGC
GCACCCATGG ATAAGGCTGA AGGCAAGATT CTTGGGCCAT ATGGCGATAC GTTGCGCCGC
TATGTCGTGC CAGTCAGCCG CGAAATTCCG CCAACTGGCA ATAGCACACT CCAACCCTAC
TTAGTTGGTT TGAGCGATAA AGTTGCGGCA ATTGGCCGCC ATCACAAAAC TGGCTTACAG
GTGTGGTCGG CTGAATGGGG CTATCCAGGC GAGGCTAACT ACCGCGAGTT CCACCGCAAA
GATAGCGAAA GCGGCATGCA ATATTGGCGG ATCACTGGGC CAAAAGTTGA CCTTGGTTAT
AAAGATTACT ATCATCCCGA TTGGGTCAAC GATAAAGTTA ATGCTCACGC CGAGCACTTC
ACGGGCTTGG TACAGCAGGT TATCAGCGAA TATCGCGGCC AAACTGGGCG CTATGGCCTG
ATTTCATCAA ACTACGATAC CGAATTATTT GGTCACTGGT GGTTTGAAGG GGTCGATTGG
ATGCGCGAAG TGCTACGGCG CTTGGCGACA AATCCCGACA TTGATCTCAC CACGGCCTCG
GAATATATCG CCAGCAACCC ACCGCGTGAA TCGTTGAACC TGCCCGAAAG TTCGTGGGGT
TCCAATGGTA CACACCAAAC CTGGCTCAAC CCTGAAACCG AGTGGATGTG GCCAATTATT
CATGCCGCCG AAAAGCGCAT GGAAGGCTTG GTCGCCAGCT ATCCACAGGC AGATGGCGCT
TTAGCCGAAG CCTTGGCTCA AACTGCGCGT GAGTTGCTCT TGCTGCAATC CAGCGATTGG
CCGTTCTTGG TCACGACTGG GCAAGCCCAA GATTACGCCA CCAAGCGTTT CAACGAGCAT
GTCGATCGCT ACAATCAATT GGCTGATGCA ATTGAGGCCA ATGATGTTGG CTTGATGGCT
GAACTAACAG CCAGTTTCAA CGAGCTTGAT AATCCATTCC CCACGATTGA TTATCATGTC
TTTGCCGCTC GCGAAGGCTC AGCAGCCTAA
 
Protein sequence
MPKQGAFTFV LHSHLPYCRK AGRWPHGEEW IHEAASETYI PLLNALNDLI NDGVTPRLTI 
GITPILTEQL ADPTILHNFE EYLDEKITAA QADMDRLADV QAVWDAAQVA EPETEPTPLL
SSEELESLIS KSDALLSSTA GDAPAPLHAG LLSATAAAST EAEADDEAES EETEEAAVEE
PAPIEQPDPH KAYLAMWYRD WYSMIKRSFI ERYNRDIVGA FRQLQDAGYI EIITCGATHG
YLPLVSRDST IYAQIAVAVQ SYERHYGRKP KAIWLPECAY RPAYYPENPS ETERKPGIEE
FLEAQGIECF FVETTTIEGG APMDKAEGKI LGPYGDTLRR YVVPVSREIP PTGNSTLQPY
LVGLSDKVAA IGRHHKTGLQ VWSAEWGYPG EANYREFHRK DSESGMQYWR ITGPKVDLGY
KDYYHPDWVN DKVNAHAEHF TGLVQQVISE YRGQTGRYGL ISSNYDTELF GHWWFEGVDW
MREVLRRLAT NPDIDLTTAS EYIASNPPRE SLNLPESSWG SNGTHQTWLN PETEWMWPII
HAAEKRMEGL VASYPQADGA LAEALAQTAR ELLLLQSSDW PFLVTTGQAQ DYATKRFNEH
VDRYNQLADA IEANDVGLMA ELTASFNELD NPFPTIDYHV FAAREGSAA