Gene Haur_4242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4242 
Symbol 
ID5736096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5410258 
End bp5411388 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content53% 
IMG OID641281397 
Productcarbamoyl-phosphate synthase, small subunit 
Protein accessionYP_001547002 
Protein GI159900755 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACGCG CATTACTGGC ATTGGAAGAT GGACGCACCT TTTGGGGGCG GGCTGTCGGT 
GCACGCGGCG AACGTGCAGG TGAAGTTGTA TTCAATACCA GCATGACTGG CTACGCTGAA
ATTTTGACCG ATCCTTCATA TCGCGGTCAG TTAGTAACCT TAACCGCCTC GCATATTGGC
AATTATGGCA TTGACGAGGT TGATCTTGAG GCAGCCATGC CTTGGGCCGA AGCCTTGATC
GTGCGTTCAT TTACCGAGCG GCCATCCAAT TGGCGTTCGC GCGAATCTCT CAGCGAGCTA
TTAGCACGGC GCGGTGTTAT GGCGGTGGCT GATTTAGATA CCCGAGCCTT GACTCGCCAT
ATTCGGGCAG CCGGCGCAAT GCGGGCGGTG CTCTCAACCG AAGATCTTGA TCCGGCTAGT
TTGGTCGCCA AAGCTCAAGC CATCCCGGTG ATGGAAGGCC GCGATTTGGC TAGCGACGTG
GGAACCCAAA GCATTTATGA GTGGAATGAA GGCACACCTG CCGATTTCAC TACCTTACAA
CTAGCGATTC CTGAACAATT GCACAACCGC CATGTCGTAG TTTACGATTT TGGGGTCAAA
CGCAACACCC TGCGGCGCTT GGTCGATCTT GGTTGTAAGG TCACAGTTGT GCCCAATCGC
ACTTCAGCCG AAGCGACCCT AGCCTTAAAA CCTGATGGTA TTTTGATTTC AAATGGCCCA
GGCGACCCTG CCACCTTGGA GTATGCGGTC GAAACGATTC GCCAGTTGAT CGGCAATGTG
CCAGTGTTTG GCATCTGCCT TGGGCATCAA TTGATTGGCC AAGCCTTGGG TGGTACAACC
TTCAAATTGC CTTTTGGTCA TCATGCTGGC AATCACCCAG TGTGCGATAC CAGCACTGGC
AAAGTCCGAA TCACTGCTCA AAATCATGGC TTTGCCCTCG ATCCAGCCAG CTTGCCCAGC
GATGTGCAGG TGACCGAAGT TAGTGGTAAC GACCAAACCT GTGAAGGCTT GCAACACAAG
AGCTTGCCTG TTTTCAGCGT GCAATATCAC CCTGAGGCTG GGCCTGGCCC TCACGATGGA
GATGAACACT TCCGGCGTTT TATCAGCCTC GTTGATCAAC AACGTAGCTA A
 
Protein sequence
MTRALLALED GRTFWGRAVG ARGERAGEVV FNTSMTGYAE ILTDPSYRGQ LVTLTASHIG 
NYGIDEVDLE AAMPWAEALI VRSFTERPSN WRSRESLSEL LARRGVMAVA DLDTRALTRH
IRAAGAMRAV LSTEDLDPAS LVAKAQAIPV MEGRDLASDV GTQSIYEWNE GTPADFTTLQ
LAIPEQLHNR HVVVYDFGVK RNTLRRLVDL GCKVTVVPNR TSAEATLALK PDGILISNGP
GDPATLEYAV ETIRQLIGNV PVFGICLGHQ LIGQALGGTT FKLPFGHHAG NHPVCDTSTG
KVRITAQNHG FALDPASLPS DVQVTEVSGN DQTCEGLQHK SLPVFSVQYH PEAGPGPHDG
DEHFRRFISL VDQQRS