Gene SeAg_B2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2071 
Symbol 
ID6795300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp2001320 
End bp2002756 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content44% 
IMG OID642776293 
Productprepilin-type N- cleavage/methylation domain protein 
Protein accessionYP_002146921 
Protein GI197249826 
COG category 
COG ID 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00288729 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGA AAAAAGGATT TACCTTACTG GAAGTCAGTA TTGTTTTAGG TATTGGAACA 
CTCATCGCTT TTATGAAGTT TCAGGATATG AGAAACAATC AGGAAGCGGT ACTGGCTGAT
AATGTCGGAA CACAGATTAA ACAGCTTGGT GAAGCGGTTA ACCGCTATAT CAGTATTCGC
TATGACAAGA TTTCCACACT GTCATCTTCT CACAATCAGA GCAGTGATCC GGGTCCAAGA
ACCTGTACGG CTGCTGGTTG TGAAATCACT TACCAGACGT TGATTAATGA AGGTCTTTTA
CCTGTCGGCT ATACTGGAAC CAACGCGCAA AAATCCACGT ATAAGATTTT GTTGAAGCGT
TCGGGAACAG CGCCTGATTA TGTGATTAAT GGTCTGATAA CAACCAGCAG CCCTTGGAAA
GAAGGTGGGC GCATTCGCTA TGACCTATTA GGCAAAGCAG TTCAGGCTGC GGGTGTTGAT
AGTGGTATGA GCAGAACAAC CAAGATAGCT TCTGGCTATG GTGGTCAGTG GAGTGAGAAT
TCAGGCAACT ACAGCAATAT TACTGATGGT GGATTGCTGG CTTACCGCGT GGGCTATAAC
TCTTCTATGT ATTCTGTTTA CTTGCGTCGT GATGGAACAT TGCCTATGAC TGGCGATCTA
AATCTTGGCG GTAAGAGCAT CAAAAACATC AAAGATATGA CGGCATCAGG AACAACAACG
ACCGGAACAC TTAATACAAC GGGTAAAGCA TCTGTTGCAT CAGATCTTGC TGTTGGTGGA
ACCTCAACCC TGAACGGGCA GGTAAATATC AATAATAACC TGAAAGTGAA ATCCGATACT
TATCTGAACA CGCTATCTAC AACAGGGCTG GCTAAGTTTG GTAGTCGTAT CGCGACCAAT
GGACTCAATC CCAACGATCT ACCATCTGGT TGGGCTGGAG GTGTTCGCAC CTATGACTTG
TATGCATCGG GGACTGTTGG CGCTGGAACC GGGAAAACGG TTAATGCTTA TATGAATAGT
GCAGGTAATA TTTTTGCGTC AGGCAATATT ACAGCCGGAA CGATAAAATC CAATGGTACA
ATTGAATCAG TTGGTAGAGT TAAAGTGGGT GAATATCTCC ATCTGAATGG TCAGGCAACG
CTGAATGCGA AATGTACGCC TAATGGTCTG GTGGGGCGAG ACAGTGCGGG TAAGGTTTTA
TCCTGCGTCA GTGGCAAGTG GTCCGAACTA ACACCTGCAG CAGCGTCAGG CATCTATGTT
CGCTACTACA ATAGCATTAA ATGGTCCTGT ACTGTTCCTA ACAGAGCGAC AGGTGGTTGT
TCTTGTTCAT CAGGTCTAAA CAGTATTTTG ATAAATACGG ATTCACAACA GTCATGTAGT
GGTGGTAAGA ACGATCACTG TAGGACATAT ACCAAATATT TTTATGCTTG TGTTTAA
 
Protein sequence
MIKKKGFTLL EVSIVLGIGT LIAFMKFQDM RNNQEAVLAD NVGTQIKQLG EAVNRYISIR 
YDKISTLSSS HNQSSDPGPR TCTAAGCEIT YQTLINEGLL PVGYTGTNAQ KSTYKILLKR
SGTAPDYVIN GLITTSSPWK EGGRIRYDLL GKAVQAAGVD SGMSRTTKIA SGYGGQWSEN
SGNYSNITDG GLLAYRVGYN SSMYSVYLRR DGTLPMTGDL NLGGKSIKNI KDMTASGTTT
TGTLNTTGKA SVASDLAVGG TSTLNGQVNI NNNLKVKSDT YLNTLSTTGL AKFGSRIATN
GLNPNDLPSG WAGGVRTYDL YASGTVGAGT GKTVNAYMNS AGNIFASGNI TAGTIKSNGT
IESVGRVKVG EYLHLNGQAT LNAKCTPNGL VGRDSAGKVL SCVSGKWSEL TPAAASGIYV
RYYNSIKWSC TVPNRATGGC SCSSGLNSIL INTDSQQSCS GGKNDHCRTY TKYFYACV