Gene Ndas_2797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2797 
Symbol 
ID9246648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3339568 
End bp3341199 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content72% 
IMG OID 
Productamino acid carrier protein 
Protein accessionYP_003680715 
Protein GI297561741 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.566332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0592572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAACG CAGTCGTCGC CGTCAACGAC TTCTTCTGGA GCTACTTCCT CATCCCGCTC 
CTGATCGTCC TGGCCGTCTA CTTCACGGTG CGCTCGGGGG TCGTGCAGCT GCGCCTGCTG
CCGGAGATGT TCCGCGTCCT GGGCAGCGCC CCGGGGGTGG CGCCCGACGG CCGACGGGAG
ATCTCCTCCT TCCAGGCCTT CTCCATCTCG GCCGCCTCAC GGGTGGGGAC GGGCAACATC
GTCGGCGTGT CCACCGCGAT CATCCTGGGC GGGCCGGGCG CGGTGTTCTG GATGTGGACG
ATGGCCGCGG TGCTGGGCGC CGCGGCGTTC GTGGAGTCCA CCCTCGCCCA GCTGTACAAG
GTGCGCACCA GCACCGGCTT CCGGGGCGGT CCGGCCTACT ACATGGAGAA GGGGCTGGGC
AGGCGGTGGA TGGGCGTGCT GTTCGCGGTG ATCATCACCG TGACCTTCAG CCTGGTGTTC
AACACGGTGC AGGCCAACAG CATCGCCGCC GCCGTGTCCA CCTCCGTGGG CGCCCTCGGC
GGGACACCCG GCCTGCCGCT GTCGGCCGTG ATCGGGCTGG TGCTGGCGGG CCTGACCGCG
CTGGTGATCT TCGGCGGGGT CAGGCGCATC GCGCACGCCG CCCAGGCGCT GGTCCCGCTC
ATGGCCGTCC TGTACATCCT CATCGGCCTG TGGGTGGTCG CCCTCAACAT AGGGGAGCTG
CCCCGGGTCG TCGGCGACAT CTTCGCGTCG GCGTTCGGCG CGCGCGAGTT CGTCGCCGGG
GGAGTGGGCA CGGCCATCGT CCAGGGCATG CGGCGCGGCA TGTTCTCCAA CGAGGCGGGT
CTGGGGTCGG CCCCCAACGC CGGAGCCACC GCCTCGGTCT CCCACCCGGC CAAGCAGGGG
CTCGTGCAGA CGCTGGGCGT GTACTTCGAC ACCTGGCTGG TGTGCTCGGT GACGGCCTTC
ATCGTGCTGG TCAGCGGCCC CGAGTACGGC GACGACCGGG GCATCCAGGT CACGCAGGGC
GCGCTGGAGG CCAGCCTGGG CGGGTGGTCG GTCCACGCGC TCACGATCAT CCTCTTCCTG
CTGGCCTTCA CCTCCGTGCT GGGCAACTAC TACTACGGGG AGACCAACCT CCAGTTCCTG
GACAGCAGCC CCCAGCACAT GACCTACTTC CGCTACGCGG TCCTGGCGGC TGTCTTCCTC
GGCTCCGTGG CGCCCCTGAC CCTGGTGTGG AGCCTGGCCG ACATCTCCAT GGGCGTGATG
GCCACCGTGA ACCTGCTGGC CCTGGCGCCG CTGTCGGGGA TCGCCTTCCG GCTGCTGGCC
GACTACAGCC GCCAGCTGCG CAACGGCCTG GACCCCCAGT TCACGCTCGC CAAGATGCCC
GGCCTGCGCA ACGTGGAGTG CTGGGGCCCG GCCTCCGGGC CCCAGCCCGA GGACGTCCCC
CGCGGGGAGG GCGAGGCGCC CCCGGACCCG GGCGCGGTCG CGGAGGAGCG GGGGCGGGAC
GGCGAGCGGT CCGACCCGGG GGAGGCGGAG TCGCACGGCG CGACGGCCGT GGACGAGGCC
GCTGACGCGG ACCGCTCCGA CGGTGACGAC GGCCCCGAGG AGGGGGACCG GGGGTCCGAC
ACCGAGCGAT GA
 
Protein sequence
MNNAVVAVND FFWSYFLIPL LIVLAVYFTV RSGVVQLRLL PEMFRVLGSA PGVAPDGRRE 
ISSFQAFSIS AASRVGTGNI VGVSTAIILG GPGAVFWMWT MAAVLGAAAF VESTLAQLYK
VRTSTGFRGG PAYYMEKGLG RRWMGVLFAV IITVTFSLVF NTVQANSIAA AVSTSVGALG
GTPGLPLSAV IGLVLAGLTA LVIFGGVRRI AHAAQALVPL MAVLYILIGL WVVALNIGEL
PRVVGDIFAS AFGAREFVAG GVGTAIVQGM RRGMFSNEAG LGSAPNAGAT ASVSHPAKQG
LVQTLGVYFD TWLVCSVTAF IVLVSGPEYG DDRGIQVTQG ALEASLGGWS VHALTIILFL
LAFTSVLGNY YYGETNLQFL DSSPQHMTYF RYAVLAAVFL GSVAPLTLVW SLADISMGVM
ATVNLLALAP LSGIAFRLLA DYSRQLRNGL DPQFTLAKMP GLRNVECWGP ASGPQPEDVP
RGEGEAPPDP GAVAEERGRD GERSDPGEAE SHGATAVDEA ADADRSDGDD GPEEGDRGSD
TER