Gene Haur_5164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5164 
Symbol 
ID5737122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp239251 
End bp240375 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content57% 
IMG OID641282329 
Productputative transposase 
Protein accessionYP_001547920 
Protein GI159901674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCCC TCAACGCCAT TATCCAGCGT TTTGGTGCAT CGTATCGGAC TCACTGCCAT 
GGGCGACTCT CGGTGCAACA ACGACGGGTC ATCAGCGCGA TCGCAGCCTG TCGGACAGAG
ACCCTTGGTG GTCAGGTCTT CACCTGTCCT ACCTGTCAGA CGACTCGCTA TAGCTACCAT
TCGTGTCGTA ATCGCCATTG TCCGACCTGT CAACAGGATG CTGGAGCCGC ATGGTTGGCC
GACCAACAAG CGCTGCTGCT GCCCGTTCCC TATTTCTTAG TCACGTTTAC GGTGCCTGCC
GAACTGCGAC CAATCGCTCT TACCAATCAA GCGCTGCTGT ATGCGGCCAT GTTTCGGGCA
TCGGCTGCCG CACTCCAACA ACTTGCCGCC GATCCGCGCC ACTTGGGTGG CCAATTGGGG
ATGCTCGGCA TCTTCCAGAC CTGGACGCGC GATTTGCGCT ACCATCCGCA CATTCATTAT
TTGATCCCCG GCGTTGGACG AACCACTGAC GAACGGATTG TCTTTCCTCC TGCTCCAGAT
TTTTTGCTTC CTGTTCGCCC CTTAGCCATG ATCTTCCGCG CCAAACTTCG CGCCGCGCTA
CGCCAAACAG CGATCGCTGC GACCATTCCC TCGACGGCGT GGGAGCATGA CTGGGTGATT
GATTGCCGTC CCGTGGGCAC CGGTGAAACA GCGCTCAAAT ATCTCGCTCC GTATATTTTC
CGCGTGGCGA TGAGTAATAA TCGCATCGTC AGCGCTGATG AGACACAGGT CACCTTTCGC
TATCGGCACA GCGCGAGTGG CGAAAACCGA ACGAGCACAC TCCCAGTGGA GACCTTCCTT
GATCGTTTTG TTGCCCATAT TTTGCCAAAA GGGTTTGTCA AAGTGCGCTA TTATGGTTTT
TTTCGGACAG GAGTCCGCGC GAGCCTGCGA CGCATTCGGG CACAATTGAT GCTCTTCCGC
AGCCACGATC TGCTGGATCG GGCGATTCCG CAACCAAAAC TGTCGGCTCA GACCCACCAG
CTGAGCACAT GCCCGGCCTG TGGATCACTG ATGCACGGTC GGCAAATCGT CTCCAGTCGC
ACACGTGCCC CGCCCCATGG GATGCATCAT CCTCGTTCTG CGTGA
 
Protein sequence
MISLNAIIQR FGASYRTHCH GRLSVQQRRV ISAIAACRTE TLGGQVFTCP TCQTTRYSYH 
SCRNRHCPTC QQDAGAAWLA DQQALLLPVP YFLVTFTVPA ELRPIALTNQ ALLYAAMFRA
SAAALQQLAA DPRHLGGQLG MLGIFQTWTR DLRYHPHIHY LIPGVGRTTD ERIVFPPAPD
FLLPVRPLAM IFRAKLRAAL RQTAIAATIP STAWEHDWVI DCRPVGTGET ALKYLAPYIF
RVAMSNNRIV SADETQVTFR YRHSASGENR TSTLPVETFL DRFVAHILPK GFVKVRYYGF
FRTGVRASLR RIRAQLMLFR SHDLLDRAIP QPKLSAQTHQ LSTCPACGSL MHGRQIVSSR
TRAPPHGMHH PRSA