Gene Clim_0679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0679 
Symbol 
ID6354293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp750787 
End bp752142 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content54% 
IMG OID642668306 
ProductNitrogenase 
Protein accessionYP_001942741 
Protein GI189346212 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0432483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATG CGAAAACCGC AACCCAGAAC GCCTGCAAGC TTTGCAATCC ATTGGGGGCA 
TGCCTGGCAT TCAGGGGCAT AGAAAAATGC GTACCCTTCC TGCACGGGTC ACAGGGATGC
GCAACCTATA TCAGAAGATA CCTGATCAGC CACTACAAGG AACCGATCGA CATTGCCTCT
TCGAACTTCA ACGAGGAAAC CGCCGTGTTC GGAGGCAGCC ACAATCTGCA GCTCGGCCTT
AAAAACGTAA CCCAGCAGTA CAAACCGGAA GTTATCGGTC TGGCAACCAC CTGTCTGAGT
GAAACCATCG GTGATGACGT GCCCATGATT CTGCGCGAAT ATAAAAAGGC ATTCAGGAAC
GGCTCCCCCA TGCCGGTCAT GATACATGCC TCGACGCCAA GTTATCAGGG CAGTCACATC
GACGGGTTTC ACGCAGCCGT GCGCGCCACC GTTGCAACGC TTGCGGTTAA AGACGCCGAA
AGGCAGCATA CGGTGAACAT ATTTCCGAAT ATGATTTCTC CTGCTGATAT CCGTTACATC
AAGGAAATTC TCGAAGCCTT CCGGCTTCCC TATATGCTGC TGCCTGACTA TTCGAAAACC
CTTGACGGTG GCCCCTGGGG CGAATACCAC AGAATCCCTC CCGGAGGCAC ACCGGCAGGC
TCCATAGCCT CAGCAGGTTC CGCATCGGCA AGCATAGAAT TCGGAGCAAC GCTTGAAGCC
CCGAAATCCG CGGCGGGACA TCTTGAATCG GCATTCGGAG TCACCCGCCA CCATATGGGA
CTTCCGATCG GCGTCAAGGC AAGCGACCGG TTTTTTGCGC TGCTCGAAGA GCTGAGCGGA
CAACCGGTAC CCGAAAAATA CGAAGACGAA CGCCGGCGTC TCATCGACGC CTATGCCGAT
GGTCACAAGT ACATTTTCGA AAAAAGGGCG ATCGTATACG GTGAAGAGGA TCTGGTCATT
GCCATGACCG CGTTTCTGCT GGAGATCGGC ATAACCCCGG TACTTTGCGC TTCCGGGGGA
AAAAGCGGCC TGCTGAAGAA AAAAATCCGG GAGCTTGTGC CCGACCTCGA TGAAGGAGAG
ATCAAAATCC GCGACGGCGT GGACTTCGTC GATATCGAAG ATGATGCCAA GGTGCTCAAA
CCGGATTTTC TGATCGGCAA CAGCAAAGGC TACACCATGT CGAGAAAAAA CAACATTCCG
CTTCTGAGAA TCGGTTTTCC CATTCACGAC CGTTTCGGAG GACAGCGGCT GCACCACCTC
GGATACCGGG GCACCCAGGA GCTGTTCGAC AGGATCGTCA ATATGGTTAT CGAAGAGCGG
CAGAATGCTT CATCAATCGG TTATACGTAC ATGTAA
 
Protein sequence
MKHAKTATQN ACKLCNPLGA CLAFRGIEKC VPFLHGSQGC ATYIRRYLIS HYKEPIDIAS 
SNFNEETAVF GGSHNLQLGL KNVTQQYKPE VIGLATTCLS ETIGDDVPMI LREYKKAFRN
GSPMPVMIHA STPSYQGSHI DGFHAAVRAT VATLAVKDAE RQHTVNIFPN MISPADIRYI
KEILEAFRLP YMLLPDYSKT LDGGPWGEYH RIPPGGTPAG SIASAGSASA SIEFGATLEA
PKSAAGHLES AFGVTRHHMG LPIGVKASDR FFALLEELSG QPVPEKYEDE RRRLIDAYAD
GHKYIFEKRA IVYGEEDLVI AMTAFLLEIG ITPVLCASGG KSGLLKKKIR ELVPDLDEGE
IKIRDGVDFV DIEDDAKVLK PDFLIGNSKG YTMSRKNNIP LLRIGFPIHD RFGGQRLHHL
GYRGTQELFD RIVNMVIEER QNASSIGYTY M