Gene Aazo_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1133 
Symbol 
ID9338928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1217250 
End bp1218347 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content41% 
IMG OID 
Productheat-inducible transcription repressor HrcA 
Protein accessionYP_003720588 
Protein GI298490411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.382582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGTCC AGCTGACAAA TCGACAACAG CATATACTTT GGGCAACTGT GCGCCACTAT 
ATTGCTACAG CAGAACCTGT TGGTTCTAAA GCCCTAATTG AAGAATTTGA CCTTGGTGTT
AGTTCCGCAA CCATCCGCAA TGTCATGGGC GTTTTAGAAA AATCAGGATT ACTTTACCAA
CCACATACTT CTGCGGGTAG AATACCTTCT GATTCTGGTT ATCGGATTTA TGTTGATCAA
CTTATTACAC CTTCTCTGCG AGACCCTACA CGAACAGAAG CTTTAACTAA AGAAGTGGAA
AATGCTTTAC AACAACGTCT CCATTGGGAA GATTGGAGTT TGGAAGCTTT TTTACAAGGT
GCGGCTCAAA TTTTGGCAAC TTTGAGTGGC TGTATTACCT TGATTACTAT GCCACAAACA
ACTACAGTGC AGTTAATACA TTTGCAATTA GTGCAAATTG AAGGTGATAG AATTATGCTG
ATTGTGGTGA CAGATAGTTA TGAGACACAT TCTAAGGTGA TGGATTTGTT CACTGCGTCG
TCAGAAACTA AACCTGATCC AGCAGTAATT GATCACGAAT TACAGATTGT TTCTAACTTT
TTGAATAGCC ATTTACGAGG ACGAAGTTTA TTAGAATTAG CCAAACTGGA TTGGAGTGAA
TTAGATCAAG AGTTTCAACG CTATGGAGAA TTCTTGAAAA ATTCAGTTGC AGAATTAGCG
CGTCGGACCG TGGTACCAAA TGCAACACAA ATTATGGTGA GGGGTGTGGG TGAGGTGTTA
CGTCAACCAG AGTTTTCTCA AGTACAACAA GTACAAACTA TCATCCATCT TTTAGAAGAA
GAACAAGAGC AATTATGGCG GTTAATTTGT GAAGAATCAG ATGTTGAGGA AATGGGTAAG
CCAAGGGTGA CAGTGAGAAT TGGGACAGAA AATCCACTAG AACCGATTCG GACTTGTTCA
TTAATTTCGT CTACTTATCG TCGGGGTTCT ATCCCTGTGG GAAGTGTAGG TGTTTTGGGT
CCAACTCGGT TAGACTATGA AGGTGCGATC GCAGTTGTGG CAGCCGCAGC AGATTATCTA
TCGGAAGCTT TTAGTTAA
 
Protein sequence
MQVQLTNRQQ HILWATVRHY IATAEPVGSK ALIEEFDLGV SSATIRNVMG VLEKSGLLYQ 
PHTSAGRIPS DSGYRIYVDQ LITPSLRDPT RTEALTKEVE NALQQRLHWE DWSLEAFLQG
AAQILATLSG CITLITMPQT TTVQLIHLQL VQIEGDRIML IVVTDSYETH SKVMDLFTAS
SETKPDPAVI DHELQIVSNF LNSHLRGRSL LELAKLDWSE LDQEFQRYGE FLKNSVAELA
RRTVVPNATQ IMVRGVGEVL RQPEFSQVQQ VQTIIHLLEE EQEQLWRLIC EESDVEEMGK
PRVTVRIGTE NPLEPIRTCS LISSTYRRGS IPVGSVGVLG PTRLDYEGAI AVVAAAADYL
SEAFS