Gene Haur_5103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5103 
Symbol 
ID5737061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp135905 
End bp137149 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content66% 
IMG OID641282268 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001547859 
Protein GI159901613 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.116678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCATC CTAGCACACC ACCGACCGTC ATTGCAACCC TTGACATCGC CAAACATACG 
CATTGGTTCG CTGTCTTTGC GCCCGATCTC ACCCCGATCA TTCCACCCCA CCCCATCACC
ACCGATGCGA CTGCCTTGCA ATCGGTGATC ACCACCCTCG CCCAGCTTGC GCTTGCTGGC
CCGGTCGCGC TCGCCATGGA GCCAACCAGT ATCTATCATC TCCCCTGGCT GCATGCCCTC
ACGGCGGCAC TCCCGCCCAC CGTCACCTGC CTGCTCGTCC ACACCACCGC CGTGCACCAT
GCCCGAACCC GCCTGACGGC AGGTCGGCTC CGCAAAACCG ATGCCCGCGA TTGCCATGCC
ATTGCCGCTG CCGTGCGCGA TGGTCATGGC CGCCCGTGGT CGCCGCCCTC CCCCCAGCAG
GCCCAATTCC GCACATGGGC CGCCCAAGAA GCCGCCACCA TGGAGACCCT TACCCAGCTC
GCCCACGCCC TCCAGCGCCT GACCGATCTG CTCTGGCCCG GCTTGGTCGC TCGCCGCAAC
GCCGCCAGCA CACCGTTGGT CTCCTCGCGC CTATGGACGC GCCACATCAT CCAGACCATC
CTGCTCCACC ACCCCGATCC CCATACCTGG CGGTCGCTCT CGGTCGCCGC GATCCGCGCC
CGCCTGAAAG CGCTCGGCAT GCGCTGTGGG ATCGGTCGCG CCACCCACCT CGCCGCCATT
CTTGCCGCAC AGGTCGTGTT GCCACCGGAG CAAACGCCAC CCTTAGCGGC TCGCCTGACC
ACCCTGATGC AGCAGTATTG TCACCACGCC ACCCACCTCG CCGCCCTGCA TGCCGAGGCC
GAAACCCTCG TTGCTGCCTC ATGGGCAGCG GTCTTGTGCT CCATCCCGGG CATGAGTCCA
GTGCTGGCGG CGCGCTATGC GGCGGCGGTC GGCGATATCC ACACAATCAC CTCGGCCAAA
GCCCTCTGGT CGCTGGCGGG ACTCGAACCA AGCATCTATG CCAGTGGCGT GCGATCCCGC
GTTGGGCAAT CCTCGTTGGC TGGCCGGATC ACGCTGCGGC AGGCCTTGAT CCGCATTGGC
GCAAGCCTCA GTCGCCATTG TCCACCTGTG CGGGCCTGCT GGCTCGCGGC GCGGGCGCGG
CGCAAACCGC TCGCTGTTGC GCTGATTCAT GCCGCGAACA AGGCCAATCG GCTGCTGTTT
GCACTGGCGA TCAGTCAACA ACCCTATCAG CCCGGCAGGG CGTGA
 
Protein sequence
MSHPSTPPTV IATLDIAKHT HWFAVFAPDL TPIIPPHPIT TDATALQSVI TTLAQLALAG 
PVALAMEPTS IYHLPWLHAL TAALPPTVTC LLVHTTAVHH ARTRLTAGRL RKTDARDCHA
IAAAVRDGHG RPWSPPSPQQ AQFRTWAAQE AATMETLTQL AHALQRLTDL LWPGLVARRN
AASTPLVSSR LWTRHIIQTI LLHHPDPHTW RSLSVAAIRA RLKALGMRCG IGRATHLAAI
LAAQVVLPPE QTPPLAARLT TLMQQYCHHA THLAALHAEA ETLVAASWAA VLCSIPGMSP
VLAARYAAAV GDIHTITSAK ALWSLAGLEP SIYASGVRSR VGQSSLAGRI TLRQALIRIG
ASLSRHCPPV RACWLAARAR RKPLAVALIH AANKANRLLF ALAISQQPYQ PGRA