Gene Cmaq_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1043 
Symbol 
ID5710179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1093572 
End bp1094825 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content46% 
IMG OID641275543 
Productanthranilate synthase 
Protein accessionYP_001540862 
Protein GI159041610 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00000237624 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGGTGC CCCTTGAGGA GTTACCTAAG CCTAGGGAAT TAGTTAGGGT TCTTATTGAG 
AATGGTGAGG ATCACGTTAC ACTACTTGAG AGTGGGCCAG GATTCCCTGA GAGGGCTAGG
TTCACCATAG TGGCCTGGGG GGTTAAGGAC TTAGTGACTA TTAATGATAA CTTATATGAT
GAGCTTAAGT CCCTTTACAG GGGACTGGGT AGGTTTGAGG ATGGTAATAT CGCTGTGGGT
TACTTATCCT ATGAGGCCAT TGCATCAATA GAGCCTCACT TAGCTGGTTT AATTAAGATG
AGTGATTGGC CTCAGGCTGA ATTCATGATA CCCAGTAACG TGGTTATCTA CGATTACTTC
CTGGGGAGGG CCTACGTTAA GGGTGAGTTA CCTAGGAGTA AGCCTGGTGA TGAGGGTGAT
TTCAAGGTCA CTGGGTTAGT GACCGCCACG GATCCCGTTG AGTACATGAA GTGGGTTTCA
GAGGCCATTG AGGATATTAA GAATGGTGAG GTCTTCCAAG TGGTTTTATC AAGGTATGAG
GAGTATGGGT TCACGGGGGA TTTAATGACT CTTTACGGGA AGTTAGCTGA CCTTAACCCA
TCACCATACA TGTACTTCAT GAGGATGAGT GATAAGTACA TAATAGGCAC AAGCCCTGAA
CTACTGGTTA AGGTTGATGG ACTCAGGGTT GAGACACACC CAATAGCTGG AACAAGACCT
AGGGGGAGGG ATTCATGGAG TGATATTAGG CTTGAGGAGG AATTACTCTC AAGCATTAAG
GATAGGGCTG AACACGTAAT GCTCGTTGAC TTAGCTAGGA ACGATATAGG TAAGGTATGC
GTATACGGTA GTGTTAAGGT TAAGGAACTA TACGCCATTG AGAAGTATCA AAGCGTCCAA
CACCTTGTTT CCAGGGTTGA GGGAATGCTG AGTAAGGGTA ATGATATTGT CGACGCCTTA
GTCTCAACAT TCCCAGCAGG CACAGTGAGT GGTGCACCTA AGCCAAGGGC AATGGAGCTA
ATAGCCAAGT ACGAGAATTC ACCAAGAGGA CCCTACGCAG GTGCATTGGG CATTATGCAT
AGTGGTGGTG GGGAATTCGC CATAATAATA AGAAGCCTAT TCTCAATGAA CGATAAAGTA
AGGATTCAGG CAGGCGCCGG CATAGTTTAC GATTCAATAC CTGAAGCCGA GCTTCAGGAA
ACTGAGGATA AGCTTGGAAG CATTAAGAGG GTGTTAGGTG TATGGCGGAC ATAA
 
Protein sequence
MKVPLEELPK PRELVRVLIE NGEDHVTLLE SGPGFPERAR FTIVAWGVKD LVTINDNLYD 
ELKSLYRGLG RFEDGNIAVG YLSYEAIASI EPHLAGLIKM SDWPQAEFMI PSNVVIYDYF
LGRAYVKGEL PRSKPGDEGD FKVTGLVTAT DPVEYMKWVS EAIEDIKNGE VFQVVLSRYE
EYGFTGDLMT LYGKLADLNP SPYMYFMRMS DKYIIGTSPE LLVKVDGLRV ETHPIAGTRP
RGRDSWSDIR LEEELLSSIK DRAEHVMLVD LARNDIGKVC VYGSVKVKEL YAIEKYQSVQ
HLVSRVEGML SKGNDIVDAL VSTFPAGTVS GAPKPRAMEL IAKYENSPRG PYAGALGIMH
SGGGEFAIII RSLFSMNDKV RIQAGAGIVY DSIPEAELQE TEDKLGSIKR VLGVWRT