Gene SeD_A4795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4795 
Symbol 
ID6874487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4649137 
End bp4650276 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content50% 
IMG OID642787686 
Producthypothetical protein 
Protein accessionYP_002218280 
Protein GI198243435 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTC GACGCTGGAT TTTAAGCGCT ACGCTTTTGC TGCTGCCTGT TCCTGCGTTC 
GCAGATTTTC AGTATCAGCA GGATAAGGAC GGCGTTTTTT ATACCGCCGA CGACGAACCG
CAAATTCTCT CCCGACTGCC TGACGTTAGC TATTCGCATT TACGGCGTAT TGCCGATTTA
TCTCACCCGC AAGACCCTCG CCCGTTAATA GAAATCAATC CCGACAGCCA TAACTGCGAC
GACAATCATA TTTGTCAGCA CGCTTATCTC AGCGATGGGC GCTTTATCCT GTGGGCAGGC
AAAATCGTCC AGAATACCGG GGATGAGCCT GCCGTTGATG TTGCCAGCTT TCAGTCTTTT
GGCGCCTTTG CCGCCGATAA ACACGGTCTC TATTTTGATG GTAAACGTCG TGATAGCAAT
GCGGGTGAAA AACGTGTGGA TATGGCGACT CTGGCAGAGA CGAAAATCTG GAATCTGCTG
CGGGATAAAA ATAATCTCTA TTATGAAGGC CGCTGGCTGG GGCGGGCCGA TGGGTTTCGC
GTGTTGAGGC TGGATTCCAC TTCGGCAAGG GAGTTTATTG TGACGACGGC GCAACGGGTG
ATTGTGAACG GCATACCCAT TACCGCTGAT GCTAATACGT TTCAAATCAT TCGCTGGATG
CCTGGCGAGG TACTAATTTA TCGTGATAAA ACCGGTAAGC ATGACTATGA GATTGATAAT
TCCAGTCGGT ACTGCGGCTA TTTTAATATT GGCCTGCGTG AGGTGACATG GCTGAAACAT
GAGGCAACCA ACGCCGGGAG CAGTTGTAAA GTGGAAACCC TGCCGGGTGT CGATCCGGAG
TATTTTTTTC GTCTGAACGG GAACACCGGT TGGTATAAGG ATCGTATTTA TCAGGTGAGC
ACGAATGCGT TGGGCGAGGG GGTACTGCGC ATTTTTACGT CGCAGGAAAA ACTTCCGGCG
CTGAAAATAG ATAGAGTTAC CTATAATTAC TACCATCTGG CTTTGTCCGC GGATGGGCAA
TTATATCGCC AGATCTCACG TGATCAATGG CAGCGCTATA ACCCGATATT AACAGAGTGG
ACGACGGTAT CACCAGCGCC CACTGACGTT ATCTCTTTGC TTCCCTCTGA TTACCACTAG
 
Protein sequence
MTIRRWILSA TLLLLPVPAF ADFQYQQDKD GVFYTADDEP QILSRLPDVS YSHLRRIADL 
SHPQDPRPLI EINPDSHNCD DNHICQHAYL SDGRFILWAG KIVQNTGDEP AVDVASFQSF
GAFAADKHGL YFDGKRRDSN AGEKRVDMAT LAETKIWNLL RDKNNLYYEG RWLGRADGFR
VLRLDSTSAR EFIVTTAQRV IVNGIPITAD ANTFQIIRWM PGEVLIYRDK TGKHDYEIDN
SSRYCGYFNI GLREVTWLKH EATNAGSSCK VETLPGVDPE YFFRLNGNTG WYKDRIYQVS
TNALGEGVLR IFTSQEKLPA LKIDRVTYNY YHLALSADGQ LYRQISRDQW QRYNPILTEW
TTVSPAPTDV ISLLPSDYH