Gene Haur_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4005 
Symbol 
ID5735866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5111356 
End bp5113224 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content53% 
IMG OID641281155 
Productasparagine synthase (glutamine-hydrolyzing) 
Protein accessionYP_001546765 
Protein GI159900518 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0367] Asparagine synthase (glutamine-hydrolyzing) 
TIGRFAM ID[TIGR01536] asparagine synthase (glutamine-hydrolyzing) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTGGCA TTACGGGCCA TCTCGAATGG CATGGCACGG CGCAAACCGC CATTCTTGAG 
CAGCAAACCG AGGCGATTCG CCATCGTGGG CCGGATAGTG CGGGCTTTTT TTGCGTTGCT
CAGGTGGCGT TGGGCATGCG GCGCTTGGCA ATTATCGACC TAACCAGTGG CGATCAGCCG
ATGTTTAGTC GTGATGGCGA TTTAGCCTTG GTGTTCAACG GCGAAATTTA TAATTTTCAG
GCTTTGCGCC AGCAATTGCA GCAACTCGGC CAACATTTCG CCACCAACAG CGATACCGAA
GTGATTGTGC GGGGGATTGA GCAATGGGGT ATTTTGGGCT GTGTGCAACG CTTGAATGGC
ATGTTTGCCT TGGCGATTTG GCAGCAACAA GCCCAACGCC TGACTTTGGT GCGCGATCGA
CTTGGGATCA AGCCTCTCTA TTGGTATTCC GATGCTCAGC GCTTGGTGTT TGGCTCGGAA
ATCAAGGCAA TTTTGGCGCA TCCAGCAGTG CCACGCCAAA TCAATCTGCA AGGTTTGGCT
AATTATCTAA GTTTTGGCCA TAGCCTCGCA CCCCAAACCA TGCTCCAAGC TATTTATAAA
GTCGAGCCAG CCCACATTCT GACATGGGAT GTTGCTAGCC GTGCCATGCA AACCCAACGC
TATTGGCAAT TGCAGCCCAA TCCAATCGCG ATTGCGCCCG CCGAAGCTGC CGCCGAATGC
TATAGCCGTT TGCGCGAAGC TGTGCGGTTG CAACTGATTA GCGATGTGCC GCTGGGTGCG
TTTCTTAGTG GCGGGCTTGA TTCAAGTATT ATCGTGGGCT TGATGCAGCG CTTGGGAGCA
GCACCAATCA ACACATTTAG CGTGGGCTTT GAGCATGCAG GCTTCAACGA ATTGCCCGAT
GCAGCACTCG TGGCCCAGCA TTTTGGCACT AAACATCATG AACTGCGCCT GAATGCCAAC
GATTTGGTTG GCGCTTTGCA AACCTTGGTT TATCACTACG ACGAACCATT TGGCGATGCC
GCAGGCTTGC CGGTATATCT CGTTTCACGC TTTGCCCGCG AACATGTCAA AGTGGTGCTG
ACGGGCGAGG GCAGCGATGA GCAATGGGCT GGTTATCGGC GCTATCAAGC CGAATTGATC
GCTCGGATGG CGCAGTATTT GCCTGGGCGT AGCGCAATTG CCGCGCTGAT TCGCCAGTTG
CCCCGTAATC GCCGCCTCAA ACAGGCAATT CGCACCTTAG ATCAGCGTGA TCCCGGGCGG
CGTTACGCTG CATGGCTGAC GATTGCCGAT GTGCAACAGC GCCAACGCTT GCTGCAACCA
CATATCAGCG CAGTGCTAGG GGATTATGCG CCTGAGCAAA TCTATGATCA GGTGTATCCG
CGCAGCGGCG CAGCCATGAC CAACATGGGC TTGGCCGATT TACAAACATG GTTGCCAGAT
ACCTATTTGG AAAAAGTTGA TAAGGCCTCG ATGGCTGCTA GTATCGAGGC GCGGGTGCCA
TTTCTTGACC ATACCTTGGT GGAATGGACG ATGAATTTGC CGCCAACACT TAAACTGCGT
GGCACAAAAA CCAAGTGGTT GCTCCGCCAA GCTTTTGGCG AAATGCTGCC GCAACGCACT
TTGCGCAAGC CTAAACATGG CTTTGCAGTG CCAACCGACC CATGGTTTCG CGGAGCATTG
AGCAACTGGA CTGCCGAAAT TTTGTTTGAT CAACGCACGC TCAGCCGTGG TTTGTTCAAT
CCGCACGAAG TCCAACGGAT CTACCAAGCC CATCGCGATG GCAAAGAAAC CGCAGATACC
GTATTATGGT TACTACTGAA CATCGAACTT TGGCAACGGA TCTATCTTGA TCGCGAGGAA
CGGCTATGA
 
Protein sequence
MCGITGHLEW HGTAQTAILE QQTEAIRHRG PDSAGFFCVA QVALGMRRLA IIDLTSGDQP 
MFSRDGDLAL VFNGEIYNFQ ALRQQLQQLG QHFATNSDTE VIVRGIEQWG ILGCVQRLNG
MFALAIWQQQ AQRLTLVRDR LGIKPLYWYS DAQRLVFGSE IKAILAHPAV PRQINLQGLA
NYLSFGHSLA PQTMLQAIYK VEPAHILTWD VASRAMQTQR YWQLQPNPIA IAPAEAAAEC
YSRLREAVRL QLISDVPLGA FLSGGLDSSI IVGLMQRLGA APINTFSVGF EHAGFNELPD
AALVAQHFGT KHHELRLNAN DLVGALQTLV YHYDEPFGDA AGLPVYLVSR FAREHVKVVL
TGEGSDEQWA GYRRYQAELI ARMAQYLPGR SAIAALIRQL PRNRRLKQAI RTLDQRDPGR
RYAAWLTIAD VQQRQRLLQP HISAVLGDYA PEQIYDQVYP RSGAAMTNMG LADLQTWLPD
TYLEKVDKAS MAASIEARVP FLDHTLVEWT MNLPPTLKLR GTKTKWLLRQ AFGEMLPQRT
LRKPKHGFAV PTDPWFRGAL SNWTAEILFD QRTLSRGLFN PHEVQRIYQA HRDGKETADT
VLWLLLNIEL WQRIYLDREE RL