Gene Noca_3075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3075 
Symbol 
ID4600192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3275296 
End bp3277692 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content74% 
IMG OID639777681 
Producttransglutaminase domain-containing protein 
Protein accessionYP_924264 
Protein GI119717299 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCAC TCACCCGGTC CCGCGGCAGC CTCTCGTCGA GCGCCCTCCT CGCGGCGGTG 
GCCGCCACGA CCACGTTGGT CGCGGTGTAC GCCTGGCGGG GCTTCACGCA GGCGCCCGGC
GGCTTCTTGA ACCCCTTGCT CCTGCTGGGA ATCGTGGTCG CGGCGACCGG CACGGCAACC
AGGTGGTGGC GGTGGCCCGG ACCGGCCGTG GTCGCCGCCC AGGTCGTCGT CTCGGGCCTC
GTCGCCAGCA TGCTCATCAC CGGGTCGCTG GTCCCGGGCA CCGGCACCCT TGGCGACGTC
CGGTCGGCGA TCGTCGCGGC GCTGGACAGC TCCCGTGAGT TCGCGGCACC GGTGCCTGCC
ACCGCGCCGC CCATCGACCC GCTGCTGATC CTGTGCGGGC TGGCCTGCCT GCTGCTCGTC
GACCTCCTCG CCTGCACGCT GCGCCGGGCG CCGCTCGCGG GCCTGCCGCT GCTGACGATC
TACACGATCC CGGTGAGCCT GATCGAGACG TCGATCGCCT GGTGGGTCTT CGTCGGCACG
GCGGCCGGCT TCCTCGCGAT GCTCTTCCTC CAGGAGGCCG AGCACGTCAG CCGCTGGGGC
CGACCGATCG CCGAGGACCG CGAGACCGGA GACCCGATCT CGCTCGGAGC CGGCGCGCAC
GCCGTTCGGC GTACCGCCAC CGGCATCGGC GGGGCCGCCA CCGCCCTCGC GATGGTGCTG
CCGCTGCTCG TCCCGACCCT CGGCCTGCAC GTGTTCGACT TCGGGCCCGG CAAGGGCGAC
GGGGACAACA TCCGGATCGA GAACCCGGTG GCGGACCTGG TGCGCGACCT CAAGCGCGGC
GAGGACACCG ACCTGGTGCG GATCACCACG ACCGAGACCA ATCCGGCGTA CCTGCGGATC
CTGGACCTGA ACCGCTTCAC CGACGTCGAG TGGACGCCCG GCGACCGCGA CGTCCCCACC
AACCACGGCG CCGACGGCGC GCTGCCGCCG CCGCAGGGCG TCGACGCCGA GGTGACCCGG
GAAGAAGTGC CCTACGACGT CACCATCCTG CCGGCGTTCG AGTCGCGGTG GCTGCCGACG
CAGTTCCCGG CCAGCAACGT GCAGGCCGAG GGCGACTGGC GCTACGACTC GACCACGATG
GACTTCCTGG CCGTTCCCGG GGACCTGACC ACCGCGGACC TGCACTACAC GATGAGCGCG
CTCGACCTGG TCCTCAGCCC GAAGCGGCTG CGCGCGGCCG GTTCGTCCGT GGGCCAGGTC
AGCGAGATCT TCACGGACCT GCCGCCGAAC CTCCCGCTGG TCGTGCGCCA GCTCGCCGTC
CAGGTGACCC AGGATCAGAC CACGCGGTTC GACAAGGCGG TGGCCCTGCA GAACTGGTTC
CGCAGCGAGT TCACCTACTC CCTGGAGACG CACGCCTCCG GCAACGGGTA CGACGCTCTC
ACCACGTTCC TCGGCGACGG CCCCGACGGC CGGGTGGGGT ACTGCGAGCA GTTCGCCTCG
GCGATGGCCG TGATGGCCCG GGTGCTGGGC ATCCCGGCAC GGGTGGCGGT CGGCTTCCTG
ACACCGGAGC CGGACGGTCC CAACACCTGG GTCTACAGCT CGCACGACAT GCACACCTGG
CCCGAGCTGT TCTTCCGCGG CTCCGGGTGG GTGCGCTTCG AGCCGACCCC GGCCGACCGG
GCGACCGGCG TGCCGTCGTA CACGGTCTCC GGCCTGCCGG GCGGCCTGGA CCCCTCCGAC
CCCGCGGCGA CGCAATCCAG CACCAGCATC CCGGGGCCGT CGAACCGGGC CACCCAGACC
GCGGACCCGA CCGCCGACAC CGGACAGAAC GGCGAGGCCG ACGCCGGCCT CGCGTGGGGA
CCGGTCCTCG GCGGCGGCGC CGGACTGCTG GTCGTGGCCG GCCTCCTGCT CCTGCCCCGG
CTGGTCCGCC GTCGCCGGCG CGCGCGTCGC CTCACGGTGG CGGGACCGGA GGCGATCTGG
GCCGAGCTGC ACGACACCGC ACTGGACCTG GGCGTGCCGT GGCCGGCGGG CCGATCCCCG
CGGGCGACCC GCGACGTCCT CGTGGACCAC CTCGGCCTGC CGGTGGACGC GACCAGTGCC
GAGCGACCCG CCCACGGCGC CGACATCGCG CCCGAGGGGG CGGCGGCGCT GGACCGGATC
GTGCTCGACG TCGAGCGGCT GCGCTATTCG CGCTCCCCCG CCGACGCCGA CCGGCCCCGG
CTCCGTGCCG ACGGCCGGAC CTGCATCGCC TCGCTGACCG GCGGCGCTCC TCGCGCGGCC
CGGCGCCGGG CGACCTGGTG GCCGCGGTCG GTCCTGGCCT TCGTCAGCCG GGCGCCGCGG
GCGGTCGCGC CGACCGTCGA GGCGAGGTAC GGCGGGGTCG TCGACCACGC GAACTGA
 
Protein sequence
MAALTRSRGS LSSSALLAAV AATTTLVAVY AWRGFTQAPG GFLNPLLLLG IVVAATGTAT 
RWWRWPGPAV VAAQVVVSGL VASMLITGSL VPGTGTLGDV RSAIVAALDS SREFAAPVPA
TAPPIDPLLI LCGLACLLLV DLLACTLRRA PLAGLPLLTI YTIPVSLIET SIAWWVFVGT
AAGFLAMLFL QEAEHVSRWG RPIAEDRETG DPISLGAGAH AVRRTATGIG GAATALAMVL
PLLVPTLGLH VFDFGPGKGD GDNIRIENPV ADLVRDLKRG EDTDLVRITT TETNPAYLRI
LDLNRFTDVE WTPGDRDVPT NHGADGALPP PQGVDAEVTR EEVPYDVTIL PAFESRWLPT
QFPASNVQAE GDWRYDSTTM DFLAVPGDLT TADLHYTMSA LDLVLSPKRL RAAGSSVGQV
SEIFTDLPPN LPLVVRQLAV QVTQDQTTRF DKAVALQNWF RSEFTYSLET HASGNGYDAL
TTFLGDGPDG RVGYCEQFAS AMAVMARVLG IPARVAVGFL TPEPDGPNTW VYSSHDMHTW
PELFFRGSGW VRFEPTPADR ATGVPSYTVS GLPGGLDPSD PAATQSSTSI PGPSNRATQT
ADPTADTGQN GEADAGLAWG PVLGGGAGLL VVAGLLLLPR LVRRRRRARR LTVAGPEAIW
AELHDTALDL GVPWPAGRSP RATRDVLVDH LGLPVDATSA ERPAHGADIA PEGAAALDRI
VLDVERLRYS RSPADADRPR LRADGRTCIA SLTGGAPRAA RRRATWWPRS VLAFVSRAPR
AVAPTVEARY GGVVDHAN