Gene Haur_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4040 
Symbol 
ID5735902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5157032 
End bp5158174 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content50% 
IMG OID641281191 
Productcell cycle protein 
Protein accessionYP_001546800 
Protein GI159900553 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG0772] Bacterial cell division membrane protein 
TIGRFAM ID[TIGR02210] rod shape-determining protein RodA 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.283521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACTA GCATTCGCGC TCGTTCTTGG CGCGAGTTTA ATCCAATTAT GGTTGTTGCG 
GTCTTGCTCT TGCTGGCAAT TAGCGTACCA ATGGTCTATA CAACCACGGT TGGAGCCGCC
GGAACCTTGG TGTTTGGGCT AGGTTCTTCG TTTGCTAAAC ATATTGTCTG GGTCAGCATG
GGCATTAGTC TGATGTTTGG TCTGGCCATG GTCGATTATC AATTGCTGCG TTCGTTAGCG
ATTGTTTTAT ATATCGCTGC GCTTGGGCTT TTGGGCATGG TGGTGGCGTT AGGCCAAGTT
AAATATGGTG CGCAAAGCTG GATCGGCTCA AGCCAACTTT CGTTTCAGCC AACCGAGCCA
GCCAAACTGA TGGTGATCAT CGCGCTTGCC GCATTTTGGA GCAAGCATGG CGATGAGCCT
AGCCCTTGGA AATCGGTCTT TATCTCGTTG GGAATTTTAG CCGTACCCCT TGGCTTGGTT
ATGCTACAGC CTGATTTTGG CTCAGGCATG GTGATGATCG GCATTTGGCT AGTGATGTCG
TTGGTTGCCA ATACCCGTTG GGTACAATAT GGCATCTTGA CCCTGTTCAG TGCGCCGGTG
GTCGTCTTAG CATGGCTCAA ATTTGATGAA TATCAACGCG AACGCTTGAC CGTGTTTCTT
ACTCCTGAGC GTTGCGAAAC CGATTTAGAG TTTCGGATGC GAGCATGTTG GCAAATTATT
CAATCGCGTT TGGCAATTGG CAATGGTGGC CTTGGCGGCA TGGGCTTGTT GCGCGGGGTG
CAAAGCCAAT TGAACTATTT GCCCGTTCAA GAGAGCGACT TTATTTTCGC GGTTACGGCG
GAAGAGTTAG GCTTTATTGG CGCGGCAGTC GTGATTGTGT TGCAATTAAT CATCATCTGG
CAAATTTGGC GCGTAGTTGA GCGAGCACGT GACCCTTTTG GGCGTTTGAT GGCGGCTGGG
GTTGCTGGCC TGTTGTTGGT GCATTGTCTC GAAAATATGG GCATGAACTT GATTATGATG
CCCATGACTG GAATTCCGCT GCCTTTTCTG AGCTATGGTG GCTCGTTTAC CCTGACGGTT
TTGATGGGCA TCGGTGTAGT GCTAAGCGTC TCGATTCGCA GTAAACGTTG GTCATTTAAT
TAA
 
Protein sequence
MSTSIRARSW REFNPIMVVA VLLLLAISVP MVYTTTVGAA GTLVFGLGSS FAKHIVWVSM 
GISLMFGLAM VDYQLLRSLA IVLYIAALGL LGMVVALGQV KYGAQSWIGS SQLSFQPTEP
AKLMVIIALA AFWSKHGDEP SPWKSVFISL GILAVPLGLV MLQPDFGSGM VMIGIWLVMS
LVANTRWVQY GILTLFSAPV VVLAWLKFDE YQRERLTVFL TPERCETDLE FRMRACWQII
QSRLAIGNGG LGGMGLLRGV QSQLNYLPVQ ESDFIFAVTA EELGFIGAAV VIVLQLIIIW
QIWRVVERAR DPFGRLMAAG VAGLLLVHCL ENMGMNLIMM PMTGIPLPFL SYGGSFTLTV
LMGIGVVLSV SIRSKRWSFN