Gene Francci3_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0097 
Symbol 
ID3902930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp118976 
End bp120046 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content73% 
IMG OID637877427 
Productvaline dehydrogenase (NAD) 
Protein accessionYP_479220 
Protein GI86738820 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0334] Glutamate dehydrogenase/leucine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.405788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCC TGTTCAGCGC CGTTTGTGAC CATGAGCAGG TGCTGTTGTG CTCGGACCGT 
CCCTCGGGTC TTCACGCGAT CATCGCCATC TACTCGACGG CGCTGGGGCC GTCGCTGGGC
GGTACCCGCT TCCACCCCTA CGCCGACGAG GAAGTCGCGT TGGCGGACGC CCTCGCCCTG
TCCCGGGCGA TGGCCTACAA GGCCGCCTGC GCCGGGCTGG ACCTCGGCGG CGGCAAGGCC
GTCATCATCG GTGATCCGGC CGTCGCGAAG TCGGAGCCAC TGCTGCGCGC CTTCGGCCGC
CATGTCGCAT CACTGGGCGG CCGATACATC ACCGCTTGCG ACGTCGGTAC CTATGTGGCG
GACATGGACG TCATCGCCCG GGAAACCCGG TGGGTAACCG GACGGTCACC GGCGCACGGC
GGTTCGGGCG ACTCCGGCGT GCTGACCGCC TACGGGGTCT TCGAGGGCAT GCGCGCCGCC
GCCCGGCACC GGTGGGGAAC GCCGAGCCTG GCGGGGCGCC GCGTCGCGGT CTCGGGAGTC
GGCAAGGTCG GGCGGCGGCT CGTCGGGCAT CTCCTCGACA GCGGCGCCTC GGTGGTCGCG
GGCGACGTGG ACCCGGTGGC CCTGGCCCGG CTGCGGGTGG AGTTCCCGGC GGCCGAGACC
GTGCCGGACC CGGACGATCT GCTCGACCTC GACATCGACG TGTACGCCCC CTGCGCGCTG
GGCGGAGCGC TGAGCGCGGA GACCGTCCGC CGGCTGCGAG CCGGCGTCGT CTGCGGCGGC
GCCAACAACC AGCTCGCGCA GCCCGAGGTC GGGCGGCAGC TCGCCGACGC CGGAGTCCTG
TACGCCCCCG ACTTCGTGGT CAACGCCGGC GGCCTGATCC AGGTCGCGGA CGAGATCGAG
GGCTACTCGC CGGAGCGGGC CCGGGCCAGG GCCGCGCAGA TCTTCGACAC CACCTCGGAG
GTGTTCCGCC TCGCCGAGGC CGAGGAGGTG ACCCCGACCG AGGCCGCGGA GCGGCTCGCC
GAACGCCGCA TGACCGACGT GGGACGCCTG CGGGGGATCC TGCTGCCCTG A
 
Protein sequence
MSSLFSAVCD HEQVLLCSDR PSGLHAIIAI YSTALGPSLG GTRFHPYADE EVALADALAL 
SRAMAYKAAC AGLDLGGGKA VIIGDPAVAK SEPLLRAFGR HVASLGGRYI TACDVGTYVA
DMDVIARETR WVTGRSPAHG GSGDSGVLTA YGVFEGMRAA ARHRWGTPSL AGRRVAVSGV
GKVGRRLVGH LLDSGASVVA GDVDPVALAR LRVEFPAAET VPDPDDLLDL DIDVYAPCAL
GGALSAETVR RLRAGVVCGG ANNQLAQPEV GRQLADAGVL YAPDFVVNAG GLIQVADEIE
GYSPERARAR AAQIFDTTSE VFRLAEAEEV TPTEAAERLA ERRMTDVGRL RGILLP