Gene SeD_A0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0740 
Symbol 
ID6874689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp737166 
End bp738299 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content61% 
IMG OID642783941 
Productrare lipoprotein A 
Protein accessionYP_002214627 
Protein GI198246252 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0797] Lipoproteins 
TIGRFAM ID[TIGR00413] rare lipoprotein A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.012679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.997721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGC AGTTGCCTGT AATCTGCGTC GCGGCAGGGA TAGTACTACT CGCGGCATGT 
ACTAATGACG GCGGTCAGCA GCAAACCACC GTCGCGCCGC AACCTGCGGT ATGTAATGGT
CCGACTGTGG AAATCAGCGG AGCGGAACCG CGTTATGAAC CTCTGAATCC GACCGCAAAC
CAGGATTATC AGCGTGACGG TAAAAGTTAT AAAATCGTTC AGGACCCGTC CCGCTTTAGC
CAGGCTGGCC TGGCCGCTAT TTATGATGCG GAACCCGGCA GTAATTTAAC CGCCTCCGGG
GAGATGTTCG ATCCGATGCA GCTTACCGCC GCCCATCCGA CACTGCCTAT CCCCAGCTAT
GCGCGAATCA CCAACCTGGC CAACGGTCGC ATGATCGTCG TGCGTATTAA CGATCGCGGC
CCCTATGGCA CCGATCGGGT CATCTCCCTG TCCCGCGCAG CGGCGGATCG TCTGAATACC
TCGAATAATA CTAAAGTACG GATTGACCCC ATTATCGTAG CGCCGGATGG TTCGCTCTCC
GGCCCGGGGA TGGCCTGTAC GACGGTGGCG AAACAGACCT ACGCTCTGCC CCCGCGCCCT
GATTTAAGCG GCGGAATGGG AAGCGCCTCT TCAGCGCCTG CGCAACCGCA AGGCGACGTT
CTTCCGGTCA GTAATTCCAC GTTGAAAAGT GACGATACCA CCGGCGCGCC GGTGAGCAGC
AGCGGCTTCC TCGGCGCGCC GACTACGCTG GCGCCTGGCG TGCTGGAGAG TAATGAACCG
ACGCCGGCAC CACAGACAGC GCCGGTTTCC GCGCCGGTGA CGGCTCCAGC GACTGCGACG
CCGGTGAGCG CGCCTGCTGC CGCGGCCCCG GTATCTGCCC CGGTGAGCGC GCCTGCCGCC
GCGGCGAGCG GTCGTTTCGT CGTCCAGGTC GGCGCCGTCA GCGATCAAAC GCGTGCGCAG
CAGTACCAAC AGCGTTTAAG CCAGCAGTTC AGCGTACCAG GGCGCGTAAT ACAAAACGGC
GCGGTCTGGC GTATTCAACT GGGGCCTTTT GCCAGTAAAG CCGAAGCCAG CGCGTTACAG
CAACGTTTGC AAACAGAAGC ACAATTACAG TCCTTTATCG CCAGCGCGCA GTAA
 
Protein sequence
MRKQLPVICV AAGIVLLAAC TNDGGQQQTT VAPQPAVCNG PTVEISGAEP RYEPLNPTAN 
QDYQRDGKSY KIVQDPSRFS QAGLAAIYDA EPGSNLTASG EMFDPMQLTA AHPTLPIPSY
ARITNLANGR MIVVRINDRG PYGTDRVISL SRAAADRLNT SNNTKVRIDP IIVAPDGSLS
GPGMACTTVA KQTYALPPRP DLSGGMGSAS SAPAQPQGDV LPVSNSTLKS DDTTGAPVSS
SGFLGAPTTL APGVLESNEP TPAPQTAPVS APVTAPATAT PVSAPAAAAP VSAPVSAPAA
AASGRFVVQV GAVSDQTRAQ QYQQRLSQQF SVPGRVIQNG AVWRIQLGPF ASKAEASALQ
QRLQTEAQLQ SFIASAQ