Gene Haur_4504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4504 
Symbol 
ID5736355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5766968 
End bp5768338 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content50% 
IMG OID641281667 
ProductPT repeat-containing protein 
Protein accessionYP_001547264 
Protein GI159901017 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAA AATTGCAACG ATGGATAACG CAGGTTATAC TCCTTGTAAC GATCGTGGTG 
GCTGGTGGAG TTAGTACCAC CTTTGCCAAT GTGGGCCAAG CAATCGATTC AGCAGCGGTA
GACACAGTAG TTGAGCCAAC CGCTGAACCA AGTGTTGAGC CAACAGCCGA GCCAACCACT
GAGCCAACTG CCGAACCAAC TGCAACCGAT CTGCCTAGCA CGCCGATTCC AACCGTAACA
CCTGTTTGTA ATCCTCAGAT TGATCTGTCT GGTTGGTTCT CCACGAACTC TCTTGGCAAA
ATTCAGAATA AATCGACTAC TTGTGTCTAC ACGGTGGGCA TGGCTTCGTA TCAAAAAGTC
GATGAAATTA TTGATCATCA AGTGATTTAT TCGTGGGAAA CCGGACGAAT TGAGCCAAAC
CAAATTCTAG CCTTGAACGT TGCAGTTCCT GAGTGTGCCG CCCAAATTGA TTTATTTTAT
GGGCCAGTTT TACACTCGCT TGATGGGCAA CGCTATGGTG AGCGATTAAT TACTGCTCGT
CATACTGGCG GAATTAACTA TTGTGGTCTT GCTGCGCCAA CCAGCACCGT TGAGCCAACC
GCAGAACCGA CGGCAACCAG CACGCTTGCG CCAACCGCAG AACCGACGGC AACCAGCACC
GTTGAGCCAA CCGCAGAGCC AACGGCAACC AGCACGCTTG CGCCAACGGC AACCAACACG
CCAACCGCAG AACCGACGGC AACCAACACG CCTGCGCCAA CGGCAACCAA CACGCCAACC
GCAGAACCGA CGGCAACTCA TACGCCTGCG CCAACTGCCA CCAATCTACC AACATCTACC
GCCACCAGAA CACCAACGGC AACGCCGACA ACACCTCCAA CCAGAACACC AACGGCTATT
CCAAGCCCAA CATCAACCAA AACACCAACT CCAACGGCAA CGATTCGACC AACCTCTACG
CCAACCAGAA CACCAGCACC GACTGCCACC AATACGCCAA CTGGTGCAAA TTGTACCTAT
ACTGATGGCT ATTGGAAGAC CCATCCGCGG GAGTGGCCTC TAGGATCGAT GATGCTCGGT
GGAGTTCAAT ATAGCCAAAA TCAATTGATG GCGATTTTTA TTATGAACGT TAGAGATGAT
ATGAGCTATA CCCTAGCGCA TCAATTGATT GCTGCCAAAT TGAATGTGGC CCAAGGTGCC
GATGGTAGTC AGATTAATGG CACTATCGCT GCTGCTGATA TGTGGCTCGA GCAAAATCCT
TTGGGCAGCA AGCCGACTGG TTTTATTGCA ACCACTGGCA CTGGCTATAG CTCGACCTTA
AATAGCTTCA ATAGCGGTTT ACTTGGCCCT GTTCACTGTA ACAACTATTA A
 
Protein sequence
MNPKLQRWIT QVILLVTIVV AGGVSTTFAN VGQAIDSAAV DTVVEPTAEP SVEPTAEPTT 
EPTAEPTATD LPSTPIPTVT PVCNPQIDLS GWFSTNSLGK IQNKSTTCVY TVGMASYQKV
DEIIDHQVIY SWETGRIEPN QILALNVAVP ECAAQIDLFY GPVLHSLDGQ RYGERLITAR
HTGGINYCGL AAPTSTVEPT AEPTATSTLA PTAEPTATST VEPTAEPTAT STLAPTATNT
PTAEPTATNT PAPTATNTPT AEPTATHTPA PTATNLPTST ATRTPTATPT TPPTRTPTAI
PSPTSTKTPT PTATIRPTST PTRTPAPTAT NTPTGANCTY TDGYWKTHPR EWPLGSMMLG
GVQYSQNQLM AIFIMNVRDD MSYTLAHQLI AAKLNVAQGA DGSQINGTIA AADMWLEQNP
LGSKPTGFIA TTGTGYSSTL NSFNSGLLGP VHCNNY