Gene Shewana3_3441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_3441 
Symbol 
ID4476784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4123590 
End bp4124747 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content52% 
IMG OID639728050 
Producthypothetical protein 
Protein accessionYP_871070 
Protein GI117921878 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2039] Pyrrolidone-carboxylate peptidase (N-terminal pyroglutamyl peptidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00415893 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.25947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAGC CAAGCCTCGT TTTTATCCTC GCCAGCACAG TGGCGCAAAC CGCAGGTGCA 
GTACAACTCC TAGGAGATGT GGAAGTTTCC CGCATTCCCA CTGCAGAAAA AACCATGGCA
GAAGTGGTCT ACCGCTATCA AGCCTTGGAC GAAGGCTTGG CGACTCAGCT TTCCGCACAG
AAGAATGAAC GCGATGCAAC CCAACTTGCC GCTCGTCAGG GGCATAGACT GTGGCAACAG
GCGGTGCGTG ATGTGCAGTC AGGGCACTTT GACGACAGAT CCCTCTACTG GGCTCGGCTC
TCAATGTTAA ATAGCATCAA GAGCAATCGC GCCAATTTCA AAATGGCCGA TTGGCAACAG
AATATTTTAG CCAGCGCAGT CGAAAAGGCA TCTCGCGGTT TTAGCGATAT CCAATACGGC
GACGATGTGC AGATAAAAAT CTTCCTGACG GGATTCGACC CTTTCTTCCT CGATAAAGAC
ATCAGCCAGA GCAATCCCTC GGGCTTGGTC GCCCTTGCCC TCGATGGTTT TAGATTTGAT
ATCAACGGCA AAAAAGCCCA AATCGAAACC GCGATGATCC CAGTGCGCTT CGAGGATTTT
GATCAAGGCA TTATCGAGTC GCTACTTAGC CCGATTTACC GCGATCCTAA AACCCAGTTT
GTCTTTACCG TCAGCATGGG CCGCAGTGAC TTTGATATTG AACGCTTCCC CGGCCGTAAC
CGTAGCGCCG CCGCGCCGGA TAACCAAAAT CTGTACACAG GCGGAAGCAA AACCGCGCCT
GTCGCCCCCA AACTCAATGG TAAAGACTTT ATCGGCCCAG AGTTTGTTGA GTTTTCACTG
CCCGTTGCCG CCATGCAGGT CAAAGACGGC CAATGGAAAG TCAACGACAA CCATACAGTG
ACCACACTAG CGCGCGGTGA ATTTAATGCC AGCTCCCTAA GCGAGCTGCA AAATGAAACC
TCGGTCGAAG GTTCTGGTGG TGGCTATCTC TCAAACGAGA TTTCTTATCG CGCCATTGTG
TTACAGCAAA AGTTCAACAG CCCAGCCAAG GTCGGCCATA TCCACACCCC AAGGGTGAAG
GGCTACGACA ACGCCACCGA ACAAGCGATT GTCGAGCAAG TGCGTACTAT GGTGATGCAG
GCCACAGCGA GCCTGTAA
 
Protein sequence
MLKPSLVFIL ASTVAQTAGA VQLLGDVEVS RIPTAEKTMA EVVYRYQALD EGLATQLSAQ 
KNERDATQLA ARQGHRLWQQ AVRDVQSGHF DDRSLYWARL SMLNSIKSNR ANFKMADWQQ
NILASAVEKA SRGFSDIQYG DDVQIKIFLT GFDPFFLDKD ISQSNPSGLV ALALDGFRFD
INGKKAQIET AMIPVRFEDF DQGIIESLLS PIYRDPKTQF VFTVSMGRSD FDIERFPGRN
RSAAAPDNQN LYTGGSKTAP VAPKLNGKDF IGPEFVEFSL PVAAMQVKDG QWKVNDNHTV
TTLARGEFNA SSLSELQNET SVEGSGGGYL SNEISYRAIV LQQKFNSPAK VGHIHTPRVK
GYDNATEQAI VEQVRTMVMQ ATASL