Gene Francci3_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3931 
Symbol 
ID3906890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4704888 
End bp4706336 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content73% 
IMG OID637881258 
Producthypothetical protein 
Protein accessionYP_483010 
Protein GI86742610 
COG category[R] General function prediction only 
COG ID[COG4310] Uncharacterized protein conserved in bacteria with an aminopeptidase-like domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.247655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATT CCTCCCGCGG CTCCGGCGGT CCCGAGCCGA ACGCTTCCCC CCTGAACGCT 
TCCCCCCCGA ACGGTTCCGA GCCCGCCGAC GCCGGCGAGT ACGCGGCGAC GGCAGGCGCC
GACCCGGGTA CGTGGCTGCA CGACCTGGTC GCCACCCTGC TGCCCCCCAT GCGCAGCATC
ACCGGGGACG GTGTCCGCAC GACGCTCGCC ACCGTCGCGC GGGCGCTCGG CCCGGAGCCC
GCGCTCACGG TGCACGAGGT CCCCAGCGGG ACACCGGTCC TGGACTGGAC CGTGCCCCGG
GAATGGAACG TCGCCTCGGC CCGGTTGACC GGCCCGGACG GCAAGACCGT CGTCGACGCC
GCTGACAACC CGCTGCACCT GCTGGGGTAC AGCACACCGG TCCGCGCCCG GCTGTCCCTC
GACGAGCTGC GCCCGCACCT GTTCTCGATG CCGGACCGCC CGGACTGGGT GCCCTACCGG
ACCTCCTACT ACACCGAGAA CTGGGGCTTC TGCCTGACCG ACCGGCAGCT CGCCGCGCTG
CCCGACGGCG AGTACGACGT GGAGATCGAC ACCACCCTCA CCGCGGGGTC GCTGACCTAC
GGCGAGATCG TGCTGCCCGG GACCACGGAC GACGAGTTCC TCATCACGAC CCACACCTGC
CACCCGGCGA TGGCGAACGA CAACTGCTCG GGCATCGCCA CGGCCACCCT GCTGGCCCGC
ACCCTGGCCG GGCTGCCCCG CCGGCACACC TTCCGGCTGC TGTTCATCCC CGGCACGATC
GGATCGATCA CCTGGCTCGC GCGCAACCGC GACACCGTCG GGCGCATTCG GCACGGGCTG
GTGCTGACCG GCCTGGGCGA CCGGTCGGAC CCGACCTACA AGCGCAGCCG GCGGGGTAAC
GCCGCCGTCG ACCGGGCCGC GGCGGCGGCG CTCGCCGAGA CCGGGCGGCC GCACCGGGTC
GTCGACTTCT CCCCCTACGG CTACGACGAA CGGCAGTTCT GCTCCCCTGG CTTCGACCTG
CCCGTCGGCC GGTTCGGGCG CGGCCAGCAC GGCGACTATC CGCAGTACCA CACGTCCGCG
GACGACCTCG ACTTCGTGAC CCCGCAGTCA CTGGCCGACT CGTTCGCGAT CCTGCTGCGG
ACGATCGACA TCTGCGAGCG CGACCGCATC TGGCGCAACA CCACCCCGTA CGGGGAGCCG
CAGCTCGGCC GGCGGGGCCT GTACCGCGCC ATCGGGGCCA CCATGAACCG CCAGGCGATC
GAGATGGGCC TGCTGTGGGT GCTGAACCTG GCCGACGGCA CGCGCAGCCT GCTCGACATC
GCCGACCGCG CGGACCTGCC GTTCGACACC GTCGCGGCGG CGGCCGATGC CCTGGCGGGC
GTCGATCTAC TCAGCGACGT CACCAGCAGC GACGTCACCA GCGCCGCGCC CGCGGGAGCC
CGGCGGTGA
 
Protein sequence
MSDSSRGSGG PEPNASPLNA SPPNGSEPAD AGEYAATAGA DPGTWLHDLV ATLLPPMRSI 
TGDGVRTTLA TVARALGPEP ALTVHEVPSG TPVLDWTVPR EWNVASARLT GPDGKTVVDA
ADNPLHLLGY STPVRARLSL DELRPHLFSM PDRPDWVPYR TSYYTENWGF CLTDRQLAAL
PDGEYDVEID TTLTAGSLTY GEIVLPGTTD DEFLITTHTC HPAMANDNCS GIATATLLAR
TLAGLPRRHT FRLLFIPGTI GSITWLARNR DTVGRIRHGL VLTGLGDRSD PTYKRSRRGN
AAVDRAAAAA LAETGRPHRV VDFSPYGYDE RQFCSPGFDL PVGRFGRGQH GDYPQYHTSA
DDLDFVTPQS LADSFAILLR TIDICERDRI WRNTTPYGEP QLGRRGLYRA IGATMNRQAI
EMGLLWVLNL ADGTRSLLDI ADRADLPFDT VAAAADALAG VDLLSDVTSS DVTSAAPAGA
RR