Gene Haur_0060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0060 
Symbol 
ID5731932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp75032 
End bp78229 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content54% 
IMG OID641277181 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_001542840 
Protein GI159896593 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAC GCAGCGACAT TCACTCGATT CTGGTGCTTG GCGCAGGCCC AATCGTGATC 
GGGCAAGCAT GCGAATTTGA TTATAGCGGC ACTCAGGCGA TCAAAGCGCT TCGCGAAGAG
GGTTTCCGCA TTATTTTGGT CAATTCCAAC CCAGCCACGA TTATGACCGA CCCGTCGTTG
GCTGATCGTA CTTATATTGA GCCGTTGGAT GTGCCAACGG TCAGCGAAAT TATTGCCCGC
GAACGGCCTG ATGCCTTGTT ACCCACGGTC GGCGGCCAAA CCGCGCTCAA TCTGGCCATG
GATTTGCACC GCGCTGGCGT ATTGGAGCAA TATAATGTCC AGTTGCTCGG CGCAAATGTC
GAAGCCATCG CTACTGCCGA GGATCGCGAG TTGTTCAAAC AGGCCATGGA TCGCATTGGC
TTGCAATCGG CGCGTTCAGG CATCGCTCGC TCGCTCGAAG AAGCTCTCGA ATTAGTCAAA
GCGACGGGCT TTCCTTCGAT TATTCGGCCT TCGTTTACTC TCGGCGGCGC TGGCGGCGGG
ATTGCCTACA ACTCTGAGGA TTTTCGGCGA ATTGTCGAAG AGGGCTTGAA ACTTTCCCCG
ATCGGCGAAC TATTGATCGA AGAAAGTTTG CTGGGCTGGA AGGAATATGA GCTTGAGTTG
ATGCGTGATA GCGCTGGCAA TGGTGTGGTG GTCTGTTCAA TTGAGAATTT CGACCCCATG
GGCGTGCACA CTGGCGACTC GATCACCGTT GCTCCGGCGA TGACCCTCTC CGACCGCGAA
TATCAACGTT TGCGCGATAT GGGCTTGGCG GTGATGGAAG TTGTTGGTGT GGCAACTGGC
GGATCGAATG TCCAATTCGC GATCAATCCC CACGATGGCC GCGTGATCGT GATCGAGATG
AATCCACGGG TTTCGCGTTC ATCGGCGCTA GCATCCAAAG CAACCGGCTT TCCAATTGCT
AAAATCGCCG CGAAGTTGGC GGTTGGCTAC ACCCTGCCCG AATTGCCCAA CGATATTACC
AAATCAACCC CAGCCTCATT CGAGCCATCG CTCGATTATG TGGTGGTCAA AATTCCACGC
TGGAACTTCG AGAAATTCCC TGGTTCGCAG CCAGTGCTTG GCACAGCGAT GAAATCAGTC
GGCGAGGCCA TGGCCATGGG CCGCACCTTC ACCGAAGCCT TGCAAAAAGC CTTGCGCTCG
TTGGAAAATG GCCGCATGGG TTGGGGCGCT GATGGCAGCC AACCTGTGGC CCAAGAACGC
TTGCGCGAAC TGTTGATCAC GCCAACGCCA CAGCGCATTT TTGCCATGCG CACCGCCTTC
GACCTTGGCT GGACGGTTGA TCAAATTCAT GAATTGACCA AAATCGATTA TTGGTTCCTT
GATCAGTTGG CTAGTTTGAT TGCGCTCGAA AAAGATGTGC GCAACTATGA TTTGGCGACG
ATTCCCGCCG ATCTATTGCT GCAAGCCAAA CGCAACGGCT ATAGCGACCC GCAATTGGCC
TATTTGCTGA ATGCAACCCC AGCAGAGGTA CGCCAACAAC GGTTTGAGCA CAATATCAAA
CCAACCTATC ATCATGTTGA TACCTGTGCG GCGGAATTTC CAGCCCAAAC GCCCTATCTC
TACTCATCAT ATGAAACCGA GAGCGAAGCC AATCCCAGCG ATCGGCGCAA AGTGATGATT
TTGGGTGGCG GGCCAAATCG CATCGGCCAA GGGCTTGAAT TTGATTATTG CTGCTGTCAC
GCGGTGTTTG CCCTGCGCGA GCTTGGCTAT GAAACGATTA TGGTCAACTG TAACCCTGAA
ACGGTTTCAA CCGATTATGA TACCGCTGAT CGACTGTATT TTGAGCCATT GACTCTCGAA
GATGTGCTGA ATATTGTCGA TGTCGAGCAA CCAGCTGGCG TGTTGTTGCA GTTTGGTGGT
CAAACCCCAC TCAAATTGGC CAAAGGCTTA GAAGCCGCAG GCGTGCCATT ATGGGGAACC
TCGCCAGCCG CAATCGACCG TGCCGAAGAT CGCGGCCAAT TTGGCGCAGT ATTGGCCGAA
TGTGGCTTGC GGGCAACGCC GTATGGATCG GCAACCTCGT GGGATGAAGC GCGGGCAATC
GCTGAGCGCA TCGGCTATCC AACGATGGTG CGGCCATCGT ATGTGCTGGG CGGCCAAGGT
ATGGCAATCA TCCACGATCA AGCAGGGCTT GACGATTATT TGCGCCATAT TACTGAGGCC
TCGCCCGAAC ACCCTGTGCT GCTCGACCGC TTCCTTGAAG ATGCGACTGA GCTTGATGTT
GATGCGGTTG CTGATGGCGA AACCGTAGTA ATTGCTGGCA TGATGGAGCA AATCGAACGC
GCAGGCATCC ACTCCGGCGA TTCGGCTTGT GTTATGCCAA CCGTCGGCAT CAAGCCTCAG
GTGATTGAAC AACTCAAAAT CGCCACTGAT AAATTAGCCC GCGCTTTGGG CGTGATTGGC
TTGATGAACG TTCAATTCGC GGTCAAGGAT GACGAAATCT ACGTGCTAGA AGTCAATCCA
CGGGCTTCGC GTTCGATTCC CTATGTTGCC AAAGCCAGTG GGGTCGCTTG GGCCGCCCTT
GCCGCCAAAG TAATGGCAGG CGTGAGCCTC AAACAACAAG GTATCAGCAG TGCGCCGGAG
TTACGCGGCT TCCACGTCAA AGAAGTTGTG TTGCCATGGC GGCGCTTCCC GGGTGCGACG
ATTGCCCTAG GCCCAGAAAT GCGCTCAACT GGCGAGGCCA TGGGTAGCGG CGATAGCTTT
GGGGCGGCCT TTGCCAAAGC CCAAGCAGGC TGTGGCCGCA GCTTGCCAAC CAGCGGCGCA
ATCTTTGTCA GTGTCAACGA CCACGATAAA CCAGCCTTAG TGCCAGTTGC CAAGCAATAT
AGCGAACTCG GCTTCAAAGT GATTGCGACT GCTGGCACAG CCCAATATCT GCGTGAGCAT
GGCTTGGTCG TCGAAACGAT CTACAAAGTT AATGAAGGCC GACCCAACGC CGCCGACTAT
ATTATCAACG GCCAAGTTGA TATTATTGTC AACACGCCAT TGGGCCGTGC CTCGTTATTT
GATGAGCAGG CAATTCGCCA AACTGCTTTG CGCCAAGGCG TGATTTCGAT CACCACCGTG
GCCAGTGCTG CGGCGGTTGC CGATGCGATT CAAGCGCTGC GCCAAGGTGG CTTGGGCGTG
CGAGCTTTGC AAGATTAA
 
Protein sequence
MPKRSDIHSI LVLGAGPIVI GQACEFDYSG TQAIKALREE GFRIILVNSN PATIMTDPSL 
ADRTYIEPLD VPTVSEIIAR ERPDALLPTV GGQTALNLAM DLHRAGVLEQ YNVQLLGANV
EAIATAEDRE LFKQAMDRIG LQSARSGIAR SLEEALELVK ATGFPSIIRP SFTLGGAGGG
IAYNSEDFRR IVEEGLKLSP IGELLIEESL LGWKEYELEL MRDSAGNGVV VCSIENFDPM
GVHTGDSITV APAMTLSDRE YQRLRDMGLA VMEVVGVATG GSNVQFAINP HDGRVIVIEM
NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLPELPNDIT KSTPASFEPS LDYVVVKIPR
WNFEKFPGSQ PVLGTAMKSV GEAMAMGRTF TEALQKALRS LENGRMGWGA DGSQPVAQER
LRELLITPTP QRIFAMRTAF DLGWTVDQIH ELTKIDYWFL DQLASLIALE KDVRNYDLAT
IPADLLLQAK RNGYSDPQLA YLLNATPAEV RQQRFEHNIK PTYHHVDTCA AEFPAQTPYL
YSSYETESEA NPSDRRKVMI LGGGPNRIGQ GLEFDYCCCH AVFALRELGY ETIMVNCNPE
TVSTDYDTAD RLYFEPLTLE DVLNIVDVEQ PAGVLLQFGG QTPLKLAKGL EAAGVPLWGT
SPAAIDRAED RGQFGAVLAE CGLRATPYGS ATSWDEARAI AERIGYPTMV RPSYVLGGQG
MAIIHDQAGL DDYLRHITEA SPEHPVLLDR FLEDATELDV DAVADGETVV IAGMMEQIER
AGIHSGDSAC VMPTVGIKPQ VIEQLKIATD KLARALGVIG LMNVQFAVKD DEIYVLEVNP
RASRSIPYVA KASGVAWAAL AAKVMAGVSL KQQGISSAPE LRGFHVKEVV LPWRRFPGAT
IALGPEMRST GEAMGSGDSF GAAFAKAQAG CGRSLPTSGA IFVSVNDHDK PALVPVAKQY
SELGFKVIAT AGTAQYLREH GLVVETIYKV NEGRPNAADY IINGQVDIIV NTPLGRASLF
DEQAIRQTAL RQGVISITTV ASAAAVADAI QALRQGGLGV RALQD