Gene Haur_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3597 
Symbol 
ID5735458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4527168 
End bp4528388 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content51% 
IMG OID641280746 
ProductSH3 type 3 domain-containing protein 
Protein accessionYP_001546361 
Protein GI159900114 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.400184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCAAG TTGCCTTAAC ACCACGTTCT CAGCCAACGA CTGTTTGGCA GTTACGGCGC 
AAAAAAGTCC CCGATCAACT TTCGTTGCCT GCTGATTATG TTCCAACTGA CTATACGTCG
TTGCCAACTC CCGAAGAGCC AAAAAAAGGT GGCGGTCTAG GGATGATTTT GGTTGTGGTG
GTCTTGTTCT TGGCGTTGGG GGCAATTGCC TATGCTGTTT GGGGGCAGGG TAGCGGCGAG
GAAGATGTTG TTCCAACCAA AGTGCCAACC ACCGCGCTAA CGATTGATCG GGCTAGTATG
ATCGTGAGCG ATGGCAAATT GACGGCTCAA ATCTCCATTA CCACAGATGC ACCTGATGAG
AGTCAAGTGG GAGCGATCTT ATTGGAAGAT GGCCGCCCAT TTGAATTTTT TGATACCGAT
GCCATGACCA CCACTGTCAG CGGCGGCAAG GCTCGCTTGA TTATTCCTGA AATGGAAGGT
CACGAAGATG GCCGCAAATC GTCGGAATAT ACTGTCCAAG TAACCGTTGC GAGCGCTGAT
GGCGATGTTC TAGCCAAAGC CGACGAAGCG ATTGAAATCA AGGGCGATGC GCTTGATCGT
TTCTTGGGCG ATACCGCAGT AACTCCAATC GATGTAACCC CAACCATCAC TGATCCAATT
TCAGGCACGA TGGTACCAAC GCCTGCTGAT GGCTCGACTC CAGTAACCCA AGTTACGCCA
GTTCAGCCAA CTGCCGCACC TGGTGGCCTG CCCGTGCCGT TGGATAATGT GGTGATCAGC
CGCCCAGGCA TTGTCTATAC CACGCCGTTT GGCCCAGCCA ACCAACGTGG CAACGTAACT
GCTGGCGAAA TTGCGCGAAT TGTCGTCAAG ATGCCAGTGA ATGGTGAAGT TTGGTATTTG
GTCGCGATCA GCCAAAGCGG TCAATCAGGC TGGCTCAATA GCAGCACGAT TGACTTACCT
GCAACTGAAG TCAACAAAAT TACCCCTGTT AGCGGCGATG CACCATTTGC CGTAGCCTTC
AATGGCGGTA ATGTGCGCTC AGCACCTGGG GGCGATGTTT TGACCCAAGT TGATGCTGGG
GTCAATGTCT CGCTGATCAA CCGCAGCAGC GATAGCGCTT GGTTCAAAAT CAAGTTACCA
AATGGTAGTG AAGGATGGGT CGTTGGTCAG ATCTTGACTA TCAACCCTGC GGTATTAAAT
ACCATCCCTG TAGCACCCTA A
 
Protein sequence
MSQVALTPRS QPTTVWQLRR KKVPDQLSLP ADYVPTDYTS LPTPEEPKKG GGLGMILVVV 
VLFLALGAIA YAVWGQGSGE EDVVPTKVPT TALTIDRASM IVSDGKLTAQ ISITTDAPDE
SQVGAILLED GRPFEFFDTD AMTTTVSGGK ARLIIPEMEG HEDGRKSSEY TVQVTVASAD
GDVLAKADEA IEIKGDALDR FLGDTAVTPI DVTPTITDPI SGTMVPTPAD GSTPVTQVTP
VQPTAAPGGL PVPLDNVVIS RPGIVYTTPF GPANQRGNVT AGEIARIVVK MPVNGEVWYL
VAISQSGQSG WLNSSTIDLP ATEVNKITPV SGDAPFAVAF NGGNVRSAPG GDVLTQVDAG
VNVSLINRSS DSAWFKIKLP NGSEGWVVGQ ILTINPAVLN TIPVAP