Gene GM21_3440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3440 
Symbol 
ID8138807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3976150 
End bp3977694 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content63% 
IMG OID644871056 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_003023221 
Protein GI253702032 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value7.79664e-23 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCTGAACC GCATTACCAG TCACCCCCTT TTTCCGTGGA TCGCCGCTGT TGCCAGCGGC 
ATCCTCTTTT TCTTAGGCTA TGCCGGCTTC GACCAGTTCT ACCTGGAGTG GATCTTCCTG
GTGCCGCTTT TCTGGGCCCT GCGTGACGCG CGCCCCGGGC GCGCCTTTCT CATCGGCTGG
GTCGCCGGTA TCGTAGGGCA CGGCGGCGGG TTTTACTGGA TCATCGAAAT GTTCAAGCAG
TTCGCAGGAG CCCCTCTCCC CTTCGCCCTG GTGGGGCTGG CGCTCTTGGC TGCGGCAAAC
GGCATCGTGG TGGCCGCCTG GGCTTGGGGA ACCAGGGTGA TCGCCGCCCG CGGCTGGCAG
GTGATCTGGG TGGCGCCGGT GGTCTGGACC GCGATGGAGA AGTTCTGGCC CGAAGTTTTC
CCCAACTACC TTGGAGCGAG CCAGTACCGG CTGTCGAATC TGACGCAGAT AGCGGATTTC
GCAGGCGTCC TCGGGGTGAG CTTCCTCGTG GTCTACATCA ACGCGACGCT GTACTGGGTG
ACCGCCTGCT GGTTCGAGGA AAAGCGCCTC CCCTGGCGTG CGCTGTCGGC CTTGGCGCTA
TCGCTTCTGT TCGTGCTGGG CTACGGAGAG ATGCGGCTTA AGGAAGTGGA GCGGCAGGTA
GCAACGGCGC AGACCCTCAA GGTCGGGCTG GTACAGGCGA ATCGGGGTGC CGCGGACCTG
CACATCGACT CCGACACTGT GCTGCAGGAG CACCGGGACA TGTCGCGGCT CCTGGTAGAA
AAGCAAAGGC CGGACCTGGT GGTCTGGCCG GAAGGGGTGC CGGTGAGCCT TTCCTCCCGG
GAAGGGGTGC TCCCAACCGC GGCACTCGGG GACCTGGGCG TCCCGCTTCT CTTCGGCGCC
TGCCTGCGGG TAGCCGACGG GATCTGCAAC AGCGCCTTTC TGGTCGACGC CTCCGGGCGC
ATCCTTGGGA GCTACGACAA GACGGTGCTG GTTCCTTTCG GAGAGTACAT TCCCTTCGGC
GACACCTTCC CCAGCCTCTA CTCCTGGTCT CCCTACTCGA GCCGCTTCTG GCGCGGCCAA
AGCGAAGAGC CGCTCCGACT GGGAAATCGC GTGCTCTCGC TCAGCATCTG CTATGAAGAC
ATCTTCCCGC TGCACATCAG AAAGCTCATG GCCGGCGGGA AGGGGAGACG GGTTCCCGAG
GCGATGTTCA ACCTCACCAA TGATTCCTGG TACGGCAACT CGATCGAACC GGTGCAGCAT
CTGGCGCTGG CCAGCTTCCG CTCCATCGAG AACCGCCGCT CTCTGGTACG CGTCACCAAT
ACCGGCATAT CCGCATTCGT GGATCCTGCC GGGCGCATCG TCAAGAGTAC AGGCATCTGG
ACCAAGGAGG TCCTGGTGGA CAAGATCCCG CTGTTACAGG GGAGGCGCAC CCCGTATTCG
GTGGCCGGAG ACTGGATCGG CTGGCTCTGT GCCTTGCTCA CTGCATCAGC GATAACTTCG
GCCTATGTCT CGACGCGTCG CAAGAGAGAA GCTGAAAAAG GGTAA
 
Protein sequence
MLNRITSHPL FPWIAAVASG ILFFLGYAGF DQFYLEWIFL VPLFWALRDA RPGRAFLIGW 
VAGIVGHGGG FYWIIEMFKQ FAGAPLPFAL VGLALLAAAN GIVVAAWAWG TRVIAARGWQ
VIWVAPVVWT AMEKFWPEVF PNYLGASQYR LSNLTQIADF AGVLGVSFLV VYINATLYWV
TACWFEEKRL PWRALSALAL SLLFVLGYGE MRLKEVERQV ATAQTLKVGL VQANRGAADL
HIDSDTVLQE HRDMSRLLVE KQRPDLVVWP EGVPVSLSSR EGVLPTAALG DLGVPLLFGA
CLRVADGICN SAFLVDASGR ILGSYDKTVL VPFGEYIPFG DTFPSLYSWS PYSSRFWRGQ
SEEPLRLGNR VLSLSICYED IFPLHIRKLM AGGKGRRVPE AMFNLTNDSW YGNSIEPVQH
LALASFRSIE NRRSLVRVTN TGISAFVDPA GRIVKSTGIW TKEVLVDKIP LLQGRRTPYS
VAGDWIGWLC ALLTASAITS AYVSTRRKRE AEKG