Gene Haur_4544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4544 
Symbol 
ID5736940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5814886 
End bp5816031 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content51% 
IMG OID641281706 
Producthypothetical protein 
Protein accessionYP_001547303 
Protein GI159901056 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAAAG AATTAACCGT GCGAGTGAGT GCCCGCACCG TCGGCTGGAC ACTGCTGATT 
TTTTCAGGGG TCTGGATTAC GATTCTGCTG AATCATGTGT TGGTGCTGTT TTTTGTGGCG
GTGCTGCTAG CAGTGGCAAT TTCGGGGGTG GTGCAACGCT TTGAACAATT GCGCATCGCC
CGCCCCATCA CAATTTTGGT GATTTATACA ATCATTATTG CGATGTTTAT TAGTTTGGGC
TTTGTGCTAG TGCCAATGGT TAGTCAACAG GTGCGACTTC TGGCTGAGCA ATTCCCTAAT
TTGGTGCGCC AACCAACCCA ACAAGCTAGC GCTTGGCTGG CCCAACAGTT TCCAACCTTG
CGTGTACCCT TGCCCACTGG CGATTTGGCT GGTCAGGCGG CACATTACGC GGGTACAGTC
GTTGGTGGGT TTAGTGGCGC AGCCTTCACT TTTGGGCGCA CCTTGATGGG TGTGATTATT
AATTTTATTG TGGTGTTGGT TTTAGCTTTT TTCCTGGTTA GCCGCGCCAA TGTTGCCAGC
AATTTTATCA AATTGATGAT TCCCAATCGC TTTCAAGAAC GCTTAATCAA TGTGACCAAT
GTGATTGGCC GCCGCCTTGG GCGTTGGGTT TGGGCGCAAC TGACAGTTGC CACCTTCTAT
GCCGTTTGTT TTGGTATGGG CTTGTGGATG TTGGGCGTAC CCTATCCGGT CGCCTTAGGC
GTAATTGGCG GCATGCTTGA GCTAATTCCC TATGTTGGCG GTTTCGTGGC CACCATTCTG
ACCATGCTCG TGGCCTTCAC GGTGCAGCCG ATGTTGGCGG TTTGGGTGCT GGTGTTGCAT
TTGATTGTTG GCAATATTGA AGTGCATATC ATTGCGCCAA AAGTCATGGG CCACGCAGTC
GAAACCCATC CAGTGATCAC GATTTTGGCC TTGTTTAGCG GGATCGAGCT TTTGGGGATT
ATCGGCGGGG TGATTGCGAT TCCATTGGCG GTGGTTGGCC AAGCATTGGT TGAAGAATTT
TGGATCAAAC GGATTCGTGA AGCCCAACCT GCGGCGGAGT CGCTAGCCGT TCAGGCTAAA
GCTCCAATCG TGCGCCGAAC TCAATTGCGT CGCCGCCCCA CATTGCGCAA ACGTCAAGGG
ATCTAG
 
Protein sequence
MPKELTVRVS ARTVGWTLLI FSGVWITILL NHVLVLFFVA VLLAVAISGV VQRFEQLRIA 
RPITILVIYT IIIAMFISLG FVLVPMVSQQ VRLLAEQFPN LVRQPTQQAS AWLAQQFPTL
RVPLPTGDLA GQAAHYAGTV VGGFSGAAFT FGRTLMGVII NFIVVLVLAF FLVSRANVAS
NFIKLMIPNR FQERLINVTN VIGRRLGRWV WAQLTVATFY AVCFGMGLWM LGVPYPVALG
VIGGMLELIP YVGGFVATIL TMLVAFTVQP MLAVWVLVLH LIVGNIEVHI IAPKVMGHAV
ETHPVITILA LFSGIELLGI IGGVIAIPLA VVGQALVEEF WIKRIREAQP AAESLAVQAK
APIVRRTQLR RRPTLRKRQG I