Gene Noca_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1103 
Symbol 
ID4599356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1164146 
End bp1166326 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content64% 
IMG OID639775699 
Productprotein of unknown function DUF1524 RloF 
Protein accessionYP_922306 
Protein GI119715341 
COG category[S] Function unknown 
COG ID[COG1479] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.509956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACGGCA AGATGGCCAT CGTGGAGACC TTCAAGCGAA CGCCGCTCCA GCTGTTCAAC 
CTGCCCCAGC ACTTCGTCAT CCCGCTGTTC CAACGTCCGT ACGTTTGGAA GGAGGACGAG
CAGTGGGAGC CGCTATGGAA GGACATCCGA CGGGTCGCCG AGCTCCGAAT GAATGAGCCG
CACCTGAACC CGCAGCACTT CCTAGGTGCA GTCGTGCTTC AGGCGCACGA CGCCGGCAGT
AACCGGGTCA CCACGTGGAA CGTCATCGAC GGGCAACAGC GGCTGACGAC GCTTCAGGTG
CTCATGGACG CCACCAGCGC GGTTCTCGCC CAGGCCGGCG CCGACCGATA CGCCAGTCAG
CTCGAGTCGC TGACCCACAA CTCGGCGAAC TTCATCCCGG AGGAGGAGAG CGGGCTGAAG
GTCCGGCACC TCAACAACGA CCACGAGGCC TTCGACGAGG TCATGGACGC TGAGTCCCCG
GTGGATTACG CCAGCCTCAA GCACTCCGAG TCCCGGATCG TGAAGGCGCA CCGGTACTTC
ACGACCGCGG TGTCCCAATG GCTCGGCGCC CCGGATGGCG ATGACTTCGG CATCAAGGCC
GAGCAGTTGG CGAGCGTGCT CCAGGCGGAC CTCCAGCTCG TGACGATCGA GCTGCTGTCG
TCGGAGAATT CACAGGAGAT CTTCGAGACG CTGAACGCTC GCGGAACTCC GCTCACCGCT
GCCGATCTCG TTCGCAACTT CGTCTTCCAG CGTCTCGAGG CCGAGGGTGG TGACACCAAG
AAGGCCTACA AGGAGGACTG GCCGTTCGAG ACGAAGTTCT GGACCAGGGA GATCAGCGTC
GGGCGCTATC TCGTGAGCAG GAGCTCGCTG TTCCTCAATC AGTGGCTCAT CGCCAGACTC
GGTGAGGAGA TTGGTCCCCA GTCGACCTTC AGTCGTTTCA AGTCGTACGT CGAGCACGAT
GCCGGCCACA AGATGGCTGA CCTCCTGCCT GTCATCAAGG AGCAGGCGGA ACGGTACGAG
GCCTGGACGG AGGTTGCCGC CAAGCCGAGC GGCAACCTGA GCGTGGTCGA AATGGCTGTC
TACCGCATGC AGGCCAGTGG GGTGGAACTG CTCAAGCCGC TTCTGATCTG GCTGCACGAG
CCAGGCCGGA ACCTTCCGCA AGAGACGATC GAGCGCATCG TCGAGGCGGC CGAGAGCTGG
ATCGTGCGGC GGCAACTCCT GAGGCTGACA GGTTCAGACC TCGGCCGGGT CGTGGCTGAC
ATCATCAAGA GGCACAGCGG CTCGCCGGCC GACGATTTGG CCGATCGGGT CGTCACCTAC
CTGTCTCGCC TCAACGTAGC CAGCACCTAC TGGCCAGGGG ACAACGAGAT CCGTGTCGCG
CTGACGACCG AGTCGGCCTA CCGCCGATTC CCACGTGCCC GGCTGCGCTC CTTCCTGGAG
GCGATCGAGA ACCACTACCG GGCCGAGACC AAGCAGCCAC AGGCTGAGCG TTCTGGCTTC
CCGATCGAGC ACATCCTGCC GCAGAAGTGG CGGGAGAACT GGCCGGTCAG CTCACCCCAA
GAAGAGCAGG AACGTCAGGA ACGTGTCCAC AAGCTGGGCA ATCTGACCCT CCTGACGGGT
CCGCTCAATT CGAAGGTCTC CAACGGTCCT TGGGACACGA AGCGCAGGGC CCTCTTGCAG
CACAACACGA TCAAGCTCAC CGGAAGGCTC CTCGACTGGG TCGGCGACGC CGATTGGTCG
GAAGCACAGA TCGACCAGCG AACAGCAGCG CTCATCGACG TCCTGCTGGA AGTCTGGCCG
GTCCCCGAGG GACACAACGG TCAGGTGGTG GACCCCCAGG CCAAGGCGCA AGACTGGATC
GAGCTGAAGC ATTTGGTCGA TGCGGGCCTG ATTGCACCGG GGGACAAGTT GATCGCCACT
CACCGCGACT TCGCCGGGCG TGAGGCCGAG ATCGGTGACG ACCTCCGCAT CCACTTGGAC
GGCAAGGCCT TCAGCACCCC GTCCGGAGCA GGACAGCACC TTCGCAAGAA GGCGACGAAC
GGCTGGTACT TCTGGGCTCT GGCAGACGGA CGCCGGCTCC GGGACGTTCG GGCGGAGTTC
TTGAGTGCCG CTTCCACCGA CGAGGGACAA CTCGAGCTGA TCGGTGGCGA GCCGGAACCG
GCCCCCGACG TACGGAGTTG A
 
Protein sequence
MDGKMAIVET FKRTPLQLFN LPQHFVIPLF QRPYVWKEDE QWEPLWKDIR RVAELRMNEP 
HLNPQHFLGA VVLQAHDAGS NRVTTWNVID GQQRLTTLQV LMDATSAVLA QAGADRYASQ
LESLTHNSAN FIPEEESGLK VRHLNNDHEA FDEVMDAESP VDYASLKHSE SRIVKAHRYF
TTAVSQWLGA PDGDDFGIKA EQLASVLQAD LQLVTIELLS SENSQEIFET LNARGTPLTA
ADLVRNFVFQ RLEAEGGDTK KAYKEDWPFE TKFWTREISV GRYLVSRSSL FLNQWLIARL
GEEIGPQSTF SRFKSYVEHD AGHKMADLLP VIKEQAERYE AWTEVAAKPS GNLSVVEMAV
YRMQASGVEL LKPLLIWLHE PGRNLPQETI ERIVEAAESW IVRRQLLRLT GSDLGRVVAD
IIKRHSGSPA DDLADRVVTY LSRLNVASTY WPGDNEIRVA LTTESAYRRF PRARLRSFLE
AIENHYRAET KQPQAERSGF PIEHILPQKW RENWPVSSPQ EEQERQERVH KLGNLTLLTG
PLNSKVSNGP WDTKRRALLQ HNTIKLTGRL LDWVGDADWS EAQIDQRTAA LIDVLLEVWP
VPEGHNGQVV DPQAKAQDWI ELKHLVDAGL IAPGDKLIAT HRDFAGREAE IGDDLRIHLD
GKAFSTPSGA GQHLRKKATN GWYFWALADG RRLRDVRAEF LSAASTDEGQ LELIGGEPEP
APDVRS