Gene Ndas_2332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2332 
Symbol 
ID9246182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2782216 
End bp2783466 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content77% 
IMG OID 
Producttype III effector Hrp-dependent outers 
Protein accessionYP_003680260 
Protein GI297561286 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.411817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCC TCGGCGTGAT CGCCGACGAC CTCACCGGAG CGACCGACGT CGCGATCGCC 
CTCACCGCGT CCGGCCACCG GACCACGGTG GTCCTGGACT CGCGGGACCC CGGCGGCGCC
GACCCCGTCG CCGCCGCGGC GGAGGGGGCC GACGCCGTCG TCGTGGCCCT GAAGTCGCGC
ACCACGCCCG CCGACGCGGC GGTCGCGGCC TCGCTGGACG CCCTCGACCG GCTGCGGAGC
GCGGGGTGCG AGCGGTTCTA CCTCAAGTAC TGCTCCACCT TCGACTCCAC CCCGGACGGC
AACATCGGCC CGGTCGCGGA GGCGGTGCTC GACGCCCTCG GCGAGGACGT CACGGTCGTC
GCACCGGCCT TCCCCGCCAA CGGGCGCACC GTCTACCGCG GCCACCTCTT CGTGGGGGAC
GACCCCCTCG ACGAGTCCCC GATGCGCCAC CACCCGCTCA CGCCCATGAC CGACTCCAGC
CTCCCGCGCC TGCTCGCCCC CCAGGTCAGC GGCGGCGGGG ACGCCATCGC GCTGGTGCCC
TGGCCGGTGG TCGCCCGGGG GGCCGAGGCG GTGCGCGACG CCATCGCGCG GGCCGGTGCG
CAGGGGGCCC GGTTCGTGGT CGTGGACGCC CTGACCGACG CCGACCTGCG CACGCTCGCC
GACGCGACGC GGGACCTGCG CCTGCTGACC GGCGGCAGCG CCCTCGCCCA GGGGCTCACC
GGCCCCCACG GGACCGGCCG CCTGCCGCTC ACCCCGCCCC GGGGGCCGCG CGTCGTCCTC
TCGGGCAGCG CCTCGCGGGC GACCCAGGGC CAGGTGCGCC ACGCCCTCGC CCACGGCGGG
GGACACCACC TGCTCCCCTC CGACCTGCGC CGGGACTTCG GGGCGACGGT GTCGCTGGCC
GTGGAGCGCG CCCTGGAGGG CGGCGCGTCC CCGTTCGTGG TGTACGCGAC CGCCGCACCC
GAACACGTCG TGGACACCGC CGACGCGCCG CTCATCGAGG AGGCTCTGGC CGAGATCGCC
GCACGCCTGG TCGCCGCGGG GGCGCGCGCG CTCCTGGTCG CGGGCGGCGA GACGAGCGGA
GCCGTCGTGC GGCGCCTGGG CGTGGCGTCG CTGGCGCTGG GACCCGAGAT CGACCCGGGC
GTCGCGTGGA CGCTCGGACA CAGCGACGGT GAGGACGTCC AGCTCATGCT CAAGTCCGGC
AACTTCGGCC GCGAGGACCT GTTCGTCCGC GCCTGGGAGG GGGACAAGTG A
 
Protein sequence
MATLGVIADD LTGATDVAIA LTASGHRTTV VLDSRDPGGA DPVAAAAEGA DAVVVALKSR 
TTPADAAVAA SLDALDRLRS AGCERFYLKY CSTFDSTPDG NIGPVAEAVL DALGEDVTVV
APAFPANGRT VYRGHLFVGD DPLDESPMRH HPLTPMTDSS LPRLLAPQVS GGGDAIALVP
WPVVARGAEA VRDAIARAGA QGARFVVVDA LTDADLRTLA DATRDLRLLT GGSALAQGLT
GPHGTGRLPL TPPRGPRVVL SGSASRATQG QVRHALAHGG GHHLLPSDLR RDFGATVSLA
VERALEGGAS PFVVYATAAP EHVVDTADAP LIEEALAEIA ARLVAAGARA LLVAGGETSG
AVVRRLGVAS LALGPEIDPG VAWTLGHSDG EDVQLMLKSG NFGREDLFVR AWEGDK