Gene Haur_3653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3653 
Symbol 
ID5735514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4593973 
End bp4595502 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content48% 
IMG OID641280802 
Producthypothetical protein 
Protein accessionYP_001546417 
Protein GI159900170 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000579216 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGAA CACGTTCCGT GATTGTTGGT TTGGTGGTAT TGCTGTTGGT AGCGCTTGGT 
GCTAGCGGTT GGCTCTGGTT TGAACGCGGC ACAATTATCT CGCGCGATGC CACGACCCAA
GCTGAATTTG GCAAATTAAA AGATCAAATT ACTGCTGCTG ATGAGGTGCA AGATAAAAAC
CGTCAGCTGC AAGACCAAGT GAATGATTTA CAAGAGCGTT TGAATAACCC ACCAACCAGC
ATTGCTGAGC CGACCAGCCC AGCGCTTGAG CCAACGCCTG AAGCTGGCCC GACTCCGACG
GTTGATCCGG CTGCCCCGAC CCCAACCGGC CCAGGTGGCG TGGAGCCGCC AGCACAAATT
GTTGAAGTGA TGAAGCAAAT TGAGCAAGAA GTGATCGCGT TGCGTGGTTT GCCAGAAGAA
CGCCCCGTCA CACGACGCAT GCTCACCCGC GATGAATTGC GTGATTATAT TGTGCGCGAA
ATGGAAACCG AAAATACTCC CGAAGATTTT CGCCGTGAAA CCAGCCAATT GTGGATGCTT
GGTTTAGCCG AAAAAGATAT TGATTTGCAA CAACTCTATA TTGATTTGCA AACTGAGCAA
ATTGGTGGAT TTTATGATCC CGAAACTGAT ACCTTCTATA TTATTGCTGA AAACAGTGAA
TTTCCACCAG CTGATCAAAT TACCTATGCC CATGAATTTA ATCATAACTT GCAAGATCAA
TTGATTAATC TGCAAGATGG CCTGAAAGTT GGCGAATTTG ATGCTGATCG ATCTTTAGCA
TTTCGTTCGT TGGTTGAAGG CGATGCAACC AAATTAATGA GCGATTGGTT GCAAAACGAT
CTTATTCCAC GCATGTCACC TGCCGAGTTG CAAGAATTAT TGCGCACCTT GCAAGAACAA
CAAGATAGCA GCAGCATTCT TGATCAAGTG CCTGGCGTGT TGCGCGATGG GCTAGTCTTT
CCTTATGAAG ATGGTTTAGC GTTTGCTGAA GCAGTTTATG CCGAAGGTGG TTGGGAGGCA
GTGACTAAGG CGTTGCAAGA CCCACCAACC TCGACCGAGC AAATTTTGCA CCCTGAAAAA
TATTTGAGTG CCACCCGCGA TAACCCAACC CTGCCCGATC AATTTGATCT GTTGCCAGTG
CTCGGTGCTG ATTGGACAAC CGCTATGACC AATACGGTTG GCGAGTTCGA TGTTAAAGCG
TTGCTCGAAT ATACCGCGAC TGCTGGCGAT ATGGAAGCTG CGGCAGCAGG AATTGGCGGC
GGTCGCATGA CCCTGTATGA ACACAACAGC GATTTCACGC CTGTGTTGCA ATGGACATTG
CGCTGGGATA GCGCCGCCGA TGGTGATGAA TTTTTGAGCT TATTCAATGG TACGCTTAAC
CCAAATGGCG ATTTGCTGCT ACGGGCTGGA GATCCAAACC GAAGTGATGA TGATGTCCAT
GTTGGAGTCA AAGGCAGTGG CCAAGAATTT GTGATTATTT TTAGTTCGAA CCAAGATTTG
GTGCGCAATG CCTTGAATGC CTTACCCTAA
 
Protein sequence
MNRTRSVIVG LVVLLLVALG ASGWLWFERG TIISRDATTQ AEFGKLKDQI TAADEVQDKN 
RQLQDQVNDL QERLNNPPTS IAEPTSPALE PTPEAGPTPT VDPAAPTPTG PGGVEPPAQI
VEVMKQIEQE VIALRGLPEE RPVTRRMLTR DELRDYIVRE METENTPEDF RRETSQLWML
GLAEKDIDLQ QLYIDLQTEQ IGGFYDPETD TFYIIAENSE FPPADQITYA HEFNHNLQDQ
LINLQDGLKV GEFDADRSLA FRSLVEGDAT KLMSDWLQND LIPRMSPAEL QELLRTLQEQ
QDSSSILDQV PGVLRDGLVF PYEDGLAFAE AVYAEGGWEA VTKALQDPPT STEQILHPEK
YLSATRDNPT LPDQFDLLPV LGADWTTAMT NTVGEFDVKA LLEYTATAGD MEAAAAGIGG
GRMTLYEHNS DFTPVLQWTL RWDSAADGDE FLSLFNGTLN PNGDLLLRAG DPNRSDDDVH
VGVKGSGQEF VIIFSSNQDL VRNALNALP