Gene Shewana3_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2022 
Symbol 
ID4476387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2417021 
End bp2418244 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content38% 
IMG OID639726605 
Productphage integrase family protein 
Protein accessionYP_869659 
Protein GI117920467 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0581728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000356379 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGGTAACA TTAGAGCGCG CGATGGCAAA CTATTTTTCG ATTTCACTTA CGAAGGGATC 
AGATGTCGAG AGCAAACGGT TTTGATTGAT AATCGATTAA ACCGATCAAA ATTGACCAAT
ATTTTGAATA CTATCGAAGC TGAAATTCGA CTGAATCGAT TTGTGTTTAA GTCCTATTTT
CCTTCGAGCT CAAGATGTAG CCAATTTTCA GAATATGACA TCAGAGTGCA ACAGCATTTG
GCTGAAAATG TTGAACCTTT GACTGTAAAT ATTATGTCAA ATATTCCTAC ATTCGAAGAG
TTTGTCGGCG AATGGGTTGT AGAAAACAAG ATTCAATGGA AAAAGAGTCA TTGTAAGAAC
ATACAGATGA TCATTGAATG TTACTTATTA CCTGCATTTG GTCATATTAG ATTAAATGAA
ATCACAAGGC CAAGTTTGAT TAAGTTTAGA GCTTCTATTG CTGACTCTAG ACGTACAGGC
ACTACAGAAA AACTAAGTAA CGACTGGATT AATCATGTAA TGACGCCACT ACGAGGGATC
TTAAATGAAG CAGCATTACG ATTTGACTTC CCTACCCCGT TTGTCAATAT CAAGCCTTTA
AGAATCGATA AGACGGTCAT AGAACCCTTT TCTCTAGCGG AAGTTCAATA TTTTTTAGAG
CACATTCGAG ATGATTTTAG AAATTATTAT ACCGTTAGAT TTTTCACTGC CATGCGAACA
GCAGAAATCG ATGGATTAAA ATGGCAATTT GTTAATTTAG ATCGTTATGA AATCTTAATC
CAAGAAACAC TTGTTGACGG TTACGTTGAA ACCCCTAAAA CATCAGCCTC TTATCGTAGT
ATTCAATTAT CACAACCTGT AGTTGATGCA TTAAAAAGAC AGCGAAAAGT AACTGGCAAT
AAGACTTATG TATTTTGTAA TGCGAGTGGC AACCCATTAG AGCACCGAAA TGTAACGAAA
CGAATTTGGT ACCCAGCTTT AGACGAAATG TGCTTAAAAA GAAGACGCCC ATACCAAACA
AGGCACACTT GCGCAACGTT ATGGCTTGCA GCAGGTGAAA ACCCTGAATG GATAGCCAAA
CAAATGGGAC ACAGCACAAC AAAAATGTTG TTTGAAGTTT ATAGCCGTTA TGTTCCAAAC
GCGACTAGGC AAGACGGTAG TGCATTCGAT CGACTCATTC AGCATGTTAA TTTTGACTTT
AACCGGGAGA GTTCAGATGA GTAA
 
Protein sequence
MGNIRARDGK LFFDFTYEGI RCREQTVLID NRLNRSKLTN ILNTIEAEIR LNRFVFKSYF 
PSSSRCSQFS EYDIRVQQHL AENVEPLTVN IMSNIPTFEE FVGEWVVENK IQWKKSHCKN
IQMIIECYLL PAFGHIRLNE ITRPSLIKFR ASIADSRRTG TTEKLSNDWI NHVMTPLRGI
LNEAALRFDF PTPFVNIKPL RIDKTVIEPF SLAEVQYFLE HIRDDFRNYY TVRFFTAMRT
AEIDGLKWQF VNLDRYEILI QETLVDGYVE TPKTSASYRS IQLSQPVVDA LKRQRKVTGN
KTYVFCNASG NPLEHRNVTK RIWYPALDEM CLKRRRPYQT RHTCATLWLA AGENPEWIAK
QMGHSTTKML FEVYSRYVPN ATRQDGSAFD RLIQHVNFDF NRESSDE