Gene TM1040_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1143 
Symbol 
ID4078439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1229411 
End bp1230922 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content63% 
IMG OID638006447 
Productanthranilate synthase component I 
Protein accessionYP_613138 
Protein GI99080984 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0793229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTGA TCCCCGATTT CGACAGTTTC GCCAAAGCCT ATGAGGCGGG CGAAAACCAG 
GTGGTCTACA CACGGCTTGC CGCCGATCTG GATACGCCCG TCTCCTTGAT GCTGAAGCTC
ACCGGCGCGC AGAAGGATGC CTTCATGCTG GAATCGGTGA CCGGCGGCGA GGTCCGCGGG
CGCTATTCCA TCATCGGCAT GAAGCCTGAC CTGATCTGGC GTGCGCGGGG CGAACAGGCC
GAAATCAACC GCGCCGCGCG GTTCGATCCC GAGGGCTTTG CCCCGCTCGA CGGCAACCCA
CTTGATACAC TGCGCGCGCT ACTGGCCGAG AGCCGCATCG ACCTGCCGGA TGATCTGCCA
CAGGCGGCGG CGGGGCTGTT TGGCTATCTG GGCTATGACA TGATCCGCCA TGTGGAGCAT
CTGCCGGATG TGAACCCCGA CCCGCTTGGC CTGCCGGACG CGGTGATGAT CCGCCCCTCC
GTGGTGGCCG TGCTGGACGG TGTCAAAGGC GAGGTCACAG TGGTCTCTCC CGCCTGGGTC
AGCGAAGGCC AGTCGGCGCG CGCGGCCTAT GCTCAGGCTG CCGAACGCGT GATGGATGCG
GTGCGTGATC TTGAACGTGC CATGCCCGCC GAGACCCGCG ATCTGGGCGA GGCGCGCGAG
GTCGCGCCCC CGGTCTCCAA CTTCACCAAG GACGGCTACA TGGCCGCCGT GGAGAAGGCC
AAGGACTACA TCCGTGCCGG CGACATCTTT CAGGTAGTGC CCGCACAGCG CTGGACGCAG
GAGTTCCCGC AGCCGCCCTT CGCGCTCTAT CGTTCGCTGC GACGCACCAA CCCCTCGCCG
TTCATGTTCT ACTTCAACTT CGGCGGTTTT CAGGTGATCG GCGCCAGCCC CGAGATCCTC
GTTCGGGTCT TTGGCAACGA GGTCACCATT CGCCCCATTG CTGGCACCCG TCCGCGCGGC
GCAACCCCCG AAGAAGACAA AGCGCTGGAA CAGGATCTGC TTGCCGACAA GAAAGAGTTG
GCCGAGCACC TGATGCTCTT GGATCTGGGC CGTAACGACG TGGGCCGCGT TGCCAAGATC
GGCACCGTGA AACCCACCGA GGAATTCATC ATCGAGCGCT ACAGCCACGT GATGCATATC
GTTTCGAATG TTGTTGGCGA ACTCCACGAG GACAAAGACG CGCTCGATGC ATTTTTTGCA
GGCATGCCTG CGGGTACGGT TTCCGGAGCG CCCAAGGTGC GTGCGATGGA GATCATCGAC
GAGCTCGAAC CCGAAAAGCG CGGCATCTAT GGCGGTGGCG TCGGCTATTT CAGCGCTGGC
GGCGACATGG ACATGTGTAT CGCGCTGCGC ACAGCCATCG TGAAGGATCA GAACCTCTAT
ATTCAGGCCG GGGGCGGCGT CGTCTATGAC AGCGACCCGG AGGCCGAATA TATGGAGACC
GTGCATAAAT CGAACGCGAT CCGCCGTGCG GCTGCAGATG CGGCGCGCTT TACCGGCAAC
GGCAACCGCT GA
 
Protein sequence
MALIPDFDSF AKAYEAGENQ VVYTRLAADL DTPVSLMLKL TGAQKDAFML ESVTGGEVRG 
RYSIIGMKPD LIWRARGEQA EINRAARFDP EGFAPLDGNP LDTLRALLAE SRIDLPDDLP
QAAAGLFGYL GYDMIRHVEH LPDVNPDPLG LPDAVMIRPS VVAVLDGVKG EVTVVSPAWV
SEGQSARAAY AQAAERVMDA VRDLERAMPA ETRDLGEARE VAPPVSNFTK DGYMAAVEKA
KDYIRAGDIF QVVPAQRWTQ EFPQPPFALY RSLRRTNPSP FMFYFNFGGF QVIGASPEIL
VRVFGNEVTI RPIAGTRPRG ATPEEDKALE QDLLADKKEL AEHLMLLDLG RNDVGRVAKI
GTVKPTEEFI IERYSHVMHI VSNVVGELHE DKDALDAFFA GMPAGTVSGA PKVRAMEIID
ELEPEKRGIY GGGVGYFSAG GDMDMCIALR TAIVKDQNLY IQAGGGVVYD SDPEAEYMET
VHKSNAIRRA AADAARFTGN GNR